FORM PTO-1390 (Rev. 1/98) (adapted) 



"U.S. DEPARTMENT OF COMMERCE — PATENT AND TRADEMARK OFFICE 
TRANSMITTAL LETTER TO THE UNITED STATES 
DESIGNATED/ELECTED OFFICE (DO/EO/US) 
CONCERNING A FILING UNDER 35 U.S.C 37 1 



ATTORNEY DOCKET NUMBER 
ABR 022 US 



US- 



in 



(to 




J7CFR 1.5) 



INTERNATIONAL APPL'N NO. 
PCT/SE00/01079 



INTERNATIONAL FILING DATE 
26 MAY 2000 



PRIORITY DATE CLAIMED 
28 MAY 1999 



Applicant herewith submits to the United States Designated/Elected Office (DO/EO/US) the following items and other 
information: 

1. X This is a FIRST submission of items concerning a filing under 35 U.S.C. 371. 

2. □ This is a SECOND or SUBSEQUENT submission of items concerning a filing under 35 U.S.C. 37 L 

3. X This express request to begin national examination procedures (35 U.S.C. 37 1 (f)) at any time rather than delay 

examination until the expiration of the applicable time limit set in 35 U.S.C. 371(b) and PCT Articles 22 and 
39(1). 

4. X A proper Demand for International Preliminary Examination was made by the 19th month from the earliest 

claimed priority date. 

5. X A copy of the International Application as filed (35 U.S.C. 371(c)(2)). 

a. X is transmitted herewith (required only if not transmitted by the International Bureau). 

b. X has been transmitted by the International Bureau. 

c. □ is not required, as the application was filed in the United States Receiving Office (RO/US). 

6. □ A translation of the International Application into English (35 U.S.C. 37 1(c)(2)). 

7. X Amendments to the claims of the International Application under PCT Article 19 (35 U.S.C. 37 1(c)(3)). 

a. X are transmitted herewith (required only if not transmitted by the International Bureau). 

b. □ have been transmitted by the International Bureau. 

c. d] have not been made and will not be made. 

8. □ A translation of the amendments to the claims under PCT Article 19 (35 U.S.C. 37 1(c)(3)). 

9. X An oath or declaration of the inventors (unexecuted) (35 U.S.C. 371(c)(4)). 

10. □ A translation of the annexes to the International Preliminary Examination Report under PCT Article 36 (35 

U.S.C. 371(c)(5)) 
Items 11. to 18. below concern document(s) or information included: 

11. □ An Information Disclosure Statement under 37 CFR 1.97 and 1.98. 

12. □ An assignment document for recording. A separate cover sheet in compliance with 37 CFR 3.28 and 3.31 is 
_ included: 

13. X A FIRST preliminary amendment. 

□ A SECOND or SUBSEQUENT preliminary amendment. 

14. □ A substitute specification. 

15. □ A change of power of attorney and/or address letter. 

16. O Formal drawings 

17. X Return receipt transmittal (MPEP §503) (specifically itemized). 

18. Q Other items of information: 



Form 1390 (XMTL) Rev. 9/98 



Page 1 of 2 



^ft Retfa t(H7PTO> ^ 6 NOV 2001 



U.S. APP 



INT'L APPL'N NO. 

PCT/SE00/01079 



ATTORNEY DOCKET NO. 

ABR 022 US 



19. 2^1 The following fees are submitted: 
BASIC NATIONAL FEE (37 CFR 1.492(a)(l)-(5): 

Neither international preliminary examination fee (37 CFR 1.482) nor inter- 
national search fee (37 CFR 1.445(a)(2)) paid to USPTO and international 

Search Report not prepared by the EPO or JPO $1040.00 

International preliminary examination fee (37 CFR 1 .482) not paid to USPTO 

but International Search Report prepared by the EPO or JPO ____ $890.00 

International preliminary examination fee (37 CFR 1.482) not paid to USPTO 

but international search fee paid to USPTO (37 CFR 1.445(a)(2)) $740.00 

International preliminary examination fee (37 CFR 1 .482) paid to USPTO 

but all claims did not satisfy provisions of PCT Article 33(l)-(4)_„ $710.00 

International preliminary examination fee (37 CFR 1 .482) paid to USPTO 

and all claims satisfied provisions of PCT Article 33(l)-(4). $100.00 

ENTER APPROPRIATE BASIC FEE AMOUNT = 



CALCULATIONS PTO USE ONLY 



$ 1,040.00 



Surcharge of $130.00 for furnishing oath or declaration later than Q 20 
E<] 30 months from the earliest claimed priority date (37 (CFR 1.492(e)). 



$ 



130.00 



CLAIMS 



NUMBER 
FILED 



NUMBER 
EXTRA 



RATE 



Total claims 



18 - 20 = 



0 



x $18.00 



$ 



Ind. claims 



12 - 3 = 



x $84.00 



$ 



756.00 



MULTIPLE DEPENDENT CLAIM(s) (if applicable) 



+ $280.00 



$ 



TOTAL OF ABOVE CALCULATIONS = 



$ 



Reduction of 1/2 for filing by small entity, if applicable. A Small Entity 
Statement must also be filed (Note 37 CFR 1.9, 1.27, 1.28) 



$ 



SUBTOTAL = 



$ 1,926.00 



Processing fee of $130.00 for furnishing the English translation later than Q 20 
□ 30 months from the earliest claimed priority date (37 (CFR 1.492(f)). 



$ 



TOTAL NATIONAL FEE = 



$ 1,926.00 



Fee for recording the enclosed assignment (37 CFR 1.21(h)). The assignment 
must be accompanied by an appropriate cover sheet (37 CFR 3.28, 3.31). $40.00 
per property +_ 



$ 



TOTAL FEES ENCLOSED = 



$ 1-926.00 



Amounttobe: 



refunded: 
charged: 



$ 



a. Q A check in the amount of $ to cover the above fees is enclosed. 

b. X Please charge my Deposit Acct. No. 50-0828 (PowderJect Tech.) in the amount of $ 1 ,926.00 

to cover the above fees. 

c. X The Commissioner is hereby authorized to charge any additional fees which may be required, or credit any 

overpayment to Deposit Acct. No. 50-0828 (PowderJect Tech.) . 

NOTE: Where an appropriate time limit under 37 CFR 1.494 or L495 has not bee/i met, a petition to revive (37 CFR 1.137(a) or (b)) 
must be filed and granted to restore the application to pending status. 



SEND ALL CORRESPONDENCE TO: 

Thomas P. McCracken 

PowderJect Technologies, Inc. 
65 1 1 Dumbarton Circle 
Fremont, CA 94555 





Signature 

Thomas P. McCracken, Reg. No. 38,548 

Tel. 510-742-9700, ext. 209 
Fax 510,742-9720 



Form 1390 (XMTL) Rev. 9/98 



Page 2 of 2 



TOTCT Rec 5 d 08 JUL 2i 



Patent 

Attorney's Docket No. 0 33267-01 1 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In re Patent Application of 



Anders FOLKESSON et aJ 



Application No.: 09/926,59 
Filed: November 26, 2001 



For: FIMBRIAL PROTEINS 




Group Art Unit: 



Examiner: 



Confirmation No. : 6671 



SUPPLEM ENTAL PRELIMINARY AMENDMENT 



Assistant Commissioner for Patents 
Washington, DC. 20231 

Sir: 

Further to the Preliminary Amendment filed on November 26, 2001, and in 
response to the Notification of Missing Requirements Under 35 U.S. C. 371 mailed on 
January 7, 2002, please amend the above-identified application as follows: 



IN THE SPECIFICATION : 

In compliance with 37 C.F.R. § 1.823(a), please substitute the attached paper copy 
of the Sequence Listing for the originally filed Sequence Listing on pages 14-66 of the 
application. 

Please replace the paragraph at page 4, lines 3-9 with the following paragraph: 
-Figure 2: Schematic representation of the pTY52 cosmid comprising the rc/-operon 
SEQ ID NO:6. 
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A tcf specific PCR fragment of 11105 bp was cloned into the Expand vector I cosmid 
(Roche). The insert is represented with a thick black line while vector sequences are 
represented with thin lines. Relevant restriction sites sequences are indicated. The 
position of the *c/-operon, i.e. tcf A, B, C and D (SEQ ID NO:6) is represented by a 
shaded arrow.— 

Please replace the paragraph at page 4, lines 15-21 with the following paragraph: 
— Sequence listing 

SEQ ID NO:l - DNA sequence of the genes encoding the precursor of the saf fimbrie unit 
of Salmonella enterica subspecies I. 

SEQ ID NO: 6 - DNA sequence of the genes which encode the precursor of the tcf fimbrie 
unit of Salmonella enterica subspecies I serovar Typhi.— 

Please replace the paragraph at page 4, lines 23-29 with the following paragraph: 
-De posit information 

The phages carrying the inserted SEQ ID NO: 1, i.e. phages clones Bl, Dl, Fll and N10 
(see Figure 1) have been given the ECACC Accession numbers 99051922, 99051923, 
99051924, and 99051925, respectively. 

The cosmide carrying the inserted SEQ ID NO: 6, i.e. cosmide pTY52 (see Figure 2) has 
been given the ECACC Accession number 99051926.— 
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TN THE CLAIMS: 

Please amend the claims as follows (all pending claims are reproduced for the 
Examiner's convenience): 

1 . (Amended) A peptide encoded by a nucleotide sequence selected from SEQ ID 
NO:l and SEQ ID NO:6 for use in medicine. 

2. (Amended) Antibodies directed against a peptide encoded by a nucleotide 
sequence selected from SEQ ID NO: 1 and SEQ ID NO: 6 for use in medicine. 

3. (Amended) A nucleotide sequence selected from SEQ ID NO:l and SEQ ID 
NO: 6 for use in medicine. 

4. (Amended) A vaccine for the protection against diseases caused by Salmonella 
enterica subspecies I, comprising a peptide encoded by a nucleotide sequence selected from 
SEQ ID NO: 1, or antibodies directed against said peptide and, optionally, a 
pharmaceutically acceptable carrier. 

5. (Amended) A vaccine for the protection against diseases caused by Salmonella 
enterica subspecies I serovar Typhi, comprising a peptide encoded by a nucleotide sequence 
selected from SEQ ID NO: 6 or antibodies directed against said peptide and, optionally, a 
pharmaceutically acceptable carrier. 
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6. (Amended) A nucleic acid vaccine for the protection against diseases caused by 
Salmonella enterica subspecies I, comprising a nucleic acid sequence selected from SEQ ID 
NO:l and, optionally, a pharmaceutically acceptable carrier. 

7. (Amended) A nucleic acid vaccine for the protection against diseases caused by 
Salmonella enterica subspecies I serovar Typhi, comprising a nucleic acid sequence 
selected from SEQ ID NO: 6 and, optionally, a pharmaceutically acceptable carrier. 

8. (Amended) A vector vaccine for the protection against diseases caused by 
Salmonella enterica subspecies I, comprising a host in which a recombinant vector 
comprising a nucleic acid sequence selected from SEQ ID NO:l, has been inserted and, 
optionally, a pharmaceutically acceptable carrier. 

9. (Amended) A vector vaccine for the protection against diseases caused by 
Salmonella enterica subspecies I serovar Typhi, comprising a host in which a recombinant 
vector comprising a nucleic acid sequence selected from SEQ ID NO:6, has been inserted 
and, optionally, a pharmaceutically acceptable carrier. 

10. A method for vaccinating a mammal against diseases caused by Salmonella 
enterica subspecies I, comprising administering a vaccine according to claim 4 to said 
mammal. 
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11. A method for vaccinating a mammal against diseases caused by Salmonella 
enterica subspecies I serovar Typhi, comprising administering a vaccine according to claim 
5 to said mammal. 

12. (Amended) Antibodies directed against a peptide encoded by a nucleotide 
sequence selected from SEQ ID NO:l, and from SEQ ID NO:6, for use in a diagnostic 
method. 

13. (Amended) Peptide encoded by a nucleotide sequence selected from SEQ ID 
NO:l, and SEQ ID NO:6, for use in a diagnostic method. 

14. (Amended) Primers for, or probes that hybridize with, a nucleotide sequence 
selected from SEQ ID NO: 1 and SEQ ID NO:6, for use in a diagnostic method for the 
purpose of detecting Salmonella enterica subspecies I. 

15. A method for vaccinating a mammal against diseases caused by Salmonella 
enterica subspecies I, comprising administering a vaccine according to claim 6 to said 
mammal. 

16. A method for vaccinating a mammal against diseases caused by Salmonella 
enterica subspecies I, comprising administering a vaccine according to claim 8 to said 
mammal. 
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17. A method for vaccinating a mammal against diseases caused by Salmonella 
enter ica subspecies I serovar Typhi, comprising administering a vaccine according to claim 
7 to said mammal. 

18. A method for vaccinating a mammal against diseases caused by Salmonella 
enter ica subspecies I, comprising administering a vaccine according to claim 9 to said 
mammal. 
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REMARKS 

Entry of the foregoing and prompt and favorable consideration of the subject 
application, in light of the following remarks, are respectfully requested. 

By the foregoing Supplemental Preliminary Amendment, the specification has been 
amended to replace the originally filed Sequence Listing with the attached copy of the 
Sequence Listing, which was filed in parent Application No. PCT/SE00/01079. Although 
the presently filed Sequence Listing is believed to be contain identical material to combined 
Sequence Listings 1 and 2 as originally filed with the present specification, this submission 
is being made in accordance with the statement pursuant to 37 C.F.R. § 1.821(f), which is 
submitted herewith. No new matter is believed to have been added. 

In addition, all references to SEQ ID NO: 2 in the specification and claims have been 
amended to recite ' SEQ ID NO: 6". The basis for this amendment is the following: The 
specification as originally filed was filed with two sequence listings, Sequence Listing No. 
1, containing SEQ I NOS:L5, and Sequence Listing No. 2, containing an additional SEQ 
ID NOS:l-6 (i.e., which were not duplicative of SEQ ID NOS.1-5 of Sequence Listing No. 

1). 

In the application as filed, sequence number 1 of Sequence Listing number 2 was 
referred to as SEQ ID NO 2. That only sequence no. 1 of Sequence Listing no. 2 could 
have been intended by this indication is clear from a comparison of page 4, line 4 of the 
description , where SEQ ID NO 2 is identified as the tcf-operon, versus the information 
given on page 36 of the Sequence Listing part under items < 223 > referring to sequence 
no. 1 of Sequence Listing no. 2. Namely, under these items, it is stated that the sequence 
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no 1 of Sequence Listi n g no 2 is the nucleic s equ e n ce of t h e tcf A. B C. and P putative 
fimhrial suhiinits respectively and of the tinR p utat i ve tran scriptio nal re gul ator, i.e ., is the 
se quence of the tcf o peron . Therefore, it would have been obvious to anyone studying the 
application as filed that SEQ ID NO:2 referred to sequence no. 1 of Sequence Listing no. 
2. Due to renumbering of the sequences, as indicated herein above, this is now sequence 
number 6, and should consequently be referred to as SEQ ID NO:6. 

To further demonstrate that only this sequence could have been intended, it may be 
noted that in Sequence Listing no. 2 of the application as filed, only the first sequence was 
? nucleotide acid se quence , the others being protein sequences. 

Moreover, on page 4, lines 16-21, the sequences referred to are DNA sequences. 
The only DNA sequences appearing in the sequence listing part are present SEQ ID NO:l 
and SEQ ID NO:6, since all the other sequences are protein sequences. A comparison of 
the description of the two sequences on page 4, lines 16-21 versus the entries made under 
items <223> on page 1 and 36, respectively of the Sequence Listing part clearly shows 
that the sequence described on page 4, lines 16-18 is SEQ ID NO: 1 and the sequence 
described on page 4, lines 19-21 is SEQ ID NO:6. 
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In the event that there are any questions relating to this Supplemental Preliminary 
Amendment, or the application in general, it would be appreciated if the Examiner would 
telephone the undersigned attorney concerning such questions so that prosecution of this 
application may be expedited. 

Respectfully submitted, 

Burns, Doane, Swecker & Mathis, L.L.P. 




Sharon E. Crane, Ph.D. 
Registration No. 36,113 



P.O. Box 1404 

Alexandria, Virginia 22313-1404 
(703) 836-6620 



Date: July 8, 2002 
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Attachment to Suppleme ntal Prelimi n a r y Amen dment filed July 8, 2002 

Mark-up of Specification 

Paragraph at page 4, lines 3-9 

Figure 2: Schematic representation of the pTY52 cosmid comprising the fc/-operon 
[(SEQ ID NO 2)] SE Q ID NO;6 . 

A tcf specific PCR fragment of 11105 bp was cloned into the Expand vector I cosmid 
(Roche). The insert is represented with a thick black line while vector sequences are 
represented with thin lines. Relevant restriction sites sequences are indicated. The 
position of the *c/-operon, i.e. tcf A, 5, C and D [(SEQ ID NO 2)] (S E Q ID NO; 6) is 
represented by a shaded arrow. 

Paragraph at page 4, lines 15-21 

Sequence listing 

[SEQ ID NO 1] SEP ID NO:l - DNA sequence of the genes encoding the precursor of the 
saf fimbrie unit of Salmonella enterica subspecies I. 

[SEQ ID NO 2] SE P ID NO: 6 - DNA sequence of the genes which encode the precursor of 
the tcf fimbrie unit of Salmonella enterica subspecies I serovar Typhi. 

Paragraph at page 4, lines 23-29 

Deposit information 

The phages carrying the inserted [SEQ ID NO 1] SEQ IDNQ;1, i.e. phages clones Bl, 
Dl, Fll and N10 (see Figure 1) have been given the ECACC Accession numbers 
99051922, 99051923, 99051924, and 99051925, respectively. 
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Attachment t o Supplemental Preli minar y Amendm ent filed July 8, 2002 

The cosmide carrying the inserted [SEQ ID NO 2] SEQ ID NQ;6, i.e. cosmide pTY52 (see 
Figure 2) has been given the ECACC Accession number 99051926. 
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Attachment t o Supplemental Preli minar y Amendment filed July 8, 2002 

Mark-up of Claims 

1. (Amended) [Peptide] A peptide encoded by a nucleotide sequence selected from 
[Sequence Listing No. 1 (SEQ ID NO 1)] SE Q TDNO;l and [Sequence Listing No. 2 (SEQ 
ID NO 2)] SE Q ID NO: 6 for use in medicine. 

2. (Amended) Antibodies directed against a peptide encoded by a nucleotide 
sequence selected from [Sequence Listing No. 1 (SEQ ID NO 1)] SEQ IDNO;l and 
[Sequence Listing No. 2 (SEQ ID NO 2)] SE Q ID NO; 6 for use in medicine. 

3. (Amended) [Nucleotide] A nucleotide sequence selected from [Sequence Listing 
No. 1 (SEQ ID NO 1)] SEP ID NO:l and [Sequence Listing No. 2 (SEQ ID NO 2)] SEQ 
ID NO:6 for use in medicine. 

4. (Amended) A vaccine for the protection against diseases caused by Salmonella 
enterica subspecies I, comprising a peptide encoded by a nucleotide sequence selected from 
[Sequence Listing No. 1 (SEQ ID NO 1)] SE Q ID NO:l . or antibodies directed against said 
peptide and, optionally, a pharmaceutically acceptable carrier. 

5. (Amended) A vaccine for the protection against diseases caused by Salmonella 
enterica subspecies I serovar Typhi, comprising a peptide encoded by a nucleotide sequence 
selected from [Sequence Listing No. 2 (SEQ ID NO 2)] SEQ ID NO; 6 or antibodies 
directed against said peptide and, optionally, a pharmaceutically acceptable carrier. 
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Attachment to Supplemental Preliminary Am e ndmen t filed July 8, 2QQ2 

6. (Amended) A nucleic acid vaccine for the protection against diseases caused by 

Salmonella ent erica subspecies I, comprising a nucleic acid sequence selected from 
[Sequence Listing No. 1 (SEQ ID NO 1)] SEQ TP NO;l and, optionally, a 
pharmaceutical^ acceptable carrier. 

7. (Amended) A nucleic acid vaccine for the protection against diseases caused by 
Salmonella enterica subspecies I serovar Typhi, comprising a nucleic acid sequence 
selected from [Sequence Listing No. 2 (SEQ ID NO 2)] SEQ ID NO; 6 and, optionally, a 
pharmaceutically acceptable carrier. 

8. (Amended) A vector vaccine for the protection against diseases caused by 
Salmonella enterica subspecies I, comprising a host in which a recombinant vector 
comprising a nucleic acid sequence selected from [Sequence Listing No. 1 (SEQ ID NO 1)] 
SE Q ID NO:l . has been inserted and, optionally, a pharmaceutically acceptable carrier. 

9. (Amended) A vector vaccine for the protection against diseases caused by 
Salmonella enterica subspecies I serovar Typhi, comprising a host in which a recombinant 
vector comprising a nucleic acid sequence selected from [Sequence Listing No. 2 (SEQ ID 
NO 2)] SE Q ID NO:6 . has been inserted and, optionally, a pharmaceutically acceptable 
carrier. 
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Attachment to S up plemental Pr e lim in ar y Amendm ent filed July 8, 2002 

10. A method for vaccinating a mammal against diseases caused by Salmonella 

ent erica subspecies I, comprising administering a vaccine according to claim 4 to said 
mammal. 

11. A method for vaccinating a mammal against diseases caused by Salmonella 

ent erica subspecies I serovar Typhi, comprising administering a vaccine according to claim 
5 to said mammal. 

12. (Amended) Antibodies directed against a peptide encoded by a nucleotide 
sequence selected from [Sequence Listing No. 1 (SEQ ID NO 1)] SEQ ID NO;l, and from 
[Sequence Listing No. 2 (SEQ ID NO 2)] SEQ ID NQ;6, for use in a diagnostic method. 

13. (Amended) Peptide encoded by a nucleotide sequence selected from [Sequence 
Listing No. 1 (SEQ ID NO 1)] SE Q ID NO:l . and [Sequence Listing No. 2 (SEQ ID NO 
2)] SE Q ID NO: 6 . for use in a diagnostic method. 

14. (Amended) Primers for, or probes that hybridize with, a nucleotide sequence 
selected from [Sequence Listing No. 1 (SEQ ID NO 1)] SEQ IPNQ;1 and [Sequence 
Listing No. 2 (SEQ ID NO 2)] SE Q ID NQ:6 . for use in a diagnostic method for the 
purpose of detecting Salmonella enterica subspecies I. 
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Attachment to Supplemental Pre liminar y Am endme n t filed Ju ly 8, 2QQ2 

15. A method for vaccinating a mammal against diseases caused by Salmonella 

enterica subspecies I, comprising administering a vaccine according to claim 6 to said 
mammal. 

16. A method for vaccinating a mammal against diseases caused by Salmonella 
enterica subspecies I, comprising administering a vaccine according to claim 8 to said 
mammal. 

17. A method for vaccinating a mammal against diseases caused by Salmonella 
enterica subspecies I serovar Typhi, comprising administering a vaccine according to claim 
7 to said mammal. 

18. A method for vaccinating a mammal against diseases caused by Salmonella 
enterica subspecies I, comprising administering a vaccine according to claim 9 to said 
mammal. 
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Attorney's Docket No. 033267-011 



S PATENT AND TRADEMARK OFFICE 



BOX: SEQUENCE 



Group Art Unit: 



Examiner: 



Confirmation No . : 667 1 



DECLARATION PURSUANT TO 
37 C.F.R. 1.821-.825 



Assistant Commissioner for Patents 
Washington, D.C. 20231 

Sir: 

I, Sharon E. Crane, declare as follows: 

1 . That the content of the paper copy of the sequence listing and 
the computer readable copy of the sequence listing submitted 
in accordance with 37 C.F.R. § 1.821(c) and (e), 
respectively, are the same in compliance with 37 C.F.R. 

§§ 1.821(f). 

2. That the submission, filed in accordance with 37 C.F.R. 

§§ 1.821(g)[or (h)] & 1.825(a), herein does not include new 
matter [or go beyond the disclosure in the international 
application] . 

I hereby declare that all statements made herein of my own knowledge are true and 
that all statements were made on information and belief and are believed to be true; and 
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further that these statements were made with the knowledge that willful false statements and 
the like so made are punishable by fine or imprisonment, or both, under Section 1001 of 
Title 18 of the United States Code and that such willful false statements may jeopardize the 



validity of the application or any patent issued. 




July 8 r 2002 

Date Sharon E. Crane, Ph.D. 

v Reg. No. 36,113 



(13/97) 
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SEQUENCE LISTING 

<110> FOLKESSONT^feRl^s; NORMARK, Staffan; LOFDAHL, Sven 

<120> Fimbrial proteins 

<130> A BR 022 US 

<140> PCT/SEOO/01079 
<141> 2000-05-26 

<150> SE/9901961-4 
<151> 1999-05-28 



<160> 11 

<170> Patentln Ver. 2,1 



<210> 1 
<211> 46870 

~<212> DNA " 

<213> Salmonella typhimurium 



<220> 
<221> CDS 

<222> (37368) . . (37868) 

<;223> safA putative fimbrial subunit 

<220> 
<221> CDS 

<222> (37952) . . (38689) 

<223> safB putative periplasmic chaperone 

<220> 
<221> CDS 

<222> (38713) . . (41223) 

<223> safC putative outer membrane usher 

<220> 
<221> CDS 

<222> (41245) . . (41715) 

<223> safD putative fimbrial subunit 



<400> 1 
gatacaaatc 


tcagggtgtt 


tttatacatc 


ctgtgaagta 


aaaaaaaccg 


tatcactgta 


60 


aaagggatac 


ggtttttttt 


cgtcttcaag 


aagttccacc 


gtctatcgtg gaatctggcg 


120 


caaatgggee 


tacgcctgga 


tgacgaacag 


gatattaccg 


ccacttcttt 


cactgtcatg 


180 


gctattttga 


tcccactgac 


atttaaggcg 


cggcctcatg 


gcggtgctta 


acegggateg 


240 


ggacatgttc 


agegcagaag 


cagactgegt 


aatgttgata 


tcactcagat 


aattaeggag 


300 


aaccgccaga 


catgcgcatc 


atcactccag 


ggcatcccac 


ttctccagca 


actccaccgg 


360 


gatctcattg 


atcacctccg 


agaaccgttt 


tcccaccagt 


ctttcagcct 


ggcgtaacag 


420 


tgggatggtc 


gggctactgg 


gttcactget 


ctcaaaccag 


egacgaatge 


accgcaggcg 


480 



> 



ttccagcgca tcattacgat cgcgaattgg ccccggttca gcattgtgtg ggagggatat 540 
accgggtggc gcgttgacag gcggcacatc ctctgcccgt gccggaacgg cattatccat 6 00 
aaccgtgcct gcggggtctg gcactggcgc tggcgacgga gggatttccg gtgtggttgt 6 60 
ctgtaccacg tcaggcagca gcgctaacaa ctgccgcagg cgtgaaaaat ctggagccag 7 20 
atcgcctaat gtctcacgcg cccacacttg caggcgttct gcgctttcct gcgcttcgcg 7 80 
aaacgccgcc agcggtaatg ccccacgggc ttcgaggtcg gccagttgct ggcggacgga 840 
ttcgggggcc agcgcatcag ccgggcgggg agcggataac gcgcgctcca catcccgtac 9 00 
ctgtaggcgc agggcggcac tgtttgacag cgtaataccg cgaatgtccg ccatcacgcc 960 
ttcgtgatcc agcagcgccg ccagggcatt actgcgcgcc agcgacgcat cttcaataga 102 0 
ttctgcggat gcgccttcgc cggtaagcag ctgtggatgg agtgcatcag accagattac 1080 
-gcFc^tct~gccagtt^ cctgctggat 1140 

gcgactgcgc agcagcagga tcaacacccg gatatcctta ctgcgcgtga gcagacggcg 1200 
cgcatcacgt tcaatttccg gccagttcac ggcttccggc gtactgacaa aatcaccata 1260 
ttgcgcctcc gcctgggggg cggcgcggct gaacagcagc aggtattccg gatcgtactc 1320 
cggatcgggg ccgcagggtt gttccgcgct taccggttta gtcagggaca tgtccatatc 1380 
attactctca gtgggttaag ccgtgttcag gttcaaaaat catgccggtc accggacggt 1440 
catgcatggc tttacctgcc caggccgtcc agccaagccg ctgcgcgccg ccgatcaccg 1500 
caggcggtgc ggcatgaggc ttcaactgca gctcaatttc ccagcaatat tcaaaaccga 1560 
tgaacgtacg caccagttct gtcagcacgg gcagattgtt gccgccgggc agaaaacgca 1620 
ggtaatcctc cagcgtgagc gggccgataa ttagccggaa tttgtactgc atatccggta 1680 
cggcctggcc gataagcgcc ccgttgccca gcaccgaaga ctgcgcgggc gtgcccagac 174 0 
gggtgatttc atcgttcgcc accgttatcc agtgcagggc gaattctttc accgcaaaag 1800 
gtacgctgaa atagtgcgcc agcgtggcgg ccagcccgtc aggattgcgc gattcgcgta 186 0 
ccagatgggc ggaggctgcc aggcgaacat gatctgacag cgggctttcg gcgctttccc 1920 
gtagatcctg cccgctgaga ctggcgatat aaaacgcaaa acggtcgtgt tccggtttgt 1980 
ccagcccgcc accagcggac tgggcgctgc gccatgcctg ccagaactgc gtcagccagc 2 040 
ggtggtgaaa aatattggaa aaatgaacca gcgtcggatc gtgacgactc tctgagcggt 2100 
tcagtgccag ctcggtatag tgcagcggca acgggccgtt tggcccccat agtccgaggc 2160 
tgtacaggct caggtgcagg cgtccatcct gccagctgac ctgggcgatt tcccgtggcg 2220 
caaaggtcat cgtcggcgtc tgtcccagac ggaatttttc catccgtggc ttgccagata 2280 



1 



cttgcctgcc ggagtatcac agagctgggc 
gtaacgccag ggggtt ttct gcgcctggtt 
ccggttctga ccggccaggt catgacatgc 
gagaaggtat taatggaaac atggcgggca 
taaggactga taccggaaaa tccttcttcg 
ccatagacca acaggccgga gccgggcagg 
atcaggctgc gcacctggcg cgactgcggg 
agatcgcgca gcgcctggcc gccggtgcgg 
aacgacaact gccggatcag ccgccaggcc 
gggcgcggcg gtctgataag acccacgccc 
tcatc^ggc^artacgtgg aataaggcag - 
acggtgatat ggcgcagatt ttccgggtag 
aagacttccg agccggtata gggggttcgg 
cggcgcggtt cacgacgcag tgaaaaataa 
gtgtgataca gcggacgaaa aatcatcttc 
tcctgaacag aaaacacctc gtaatccagc 
tccgtcacgc tgtgagtgac ttcaatccgg 
ggcgtacaga acaggctgaa ctgtgcagcg 
aggcggttaa gcagtatgac aatttccgcc 
gacaatccgg tcggggtgaa gaagtaaaac 
agattatggc cgtgaaacac gttccaggcg 
ccttcgtgcg ccaccgggtg ttgaagattc 
tcaccggcca gtgtggcgac ggcgctggta 
cgttctt cgc cgcagaggta aaagggcagc 
agttccccga aggttcgcag ggtgatgcgc 
agatagcggt gcagggcggg catatccggc 
gacagcggcc acagcgtgac gtcctggctg 
tccgggatgg gggaaacgaa cgcggtatcg 
ccttcctgcg tatcgggata cagctttacc 
ttggggctga cgacttccag taaccgctgt 



atccacgcgc cgcatcaggt tcaggaatcc 2 340 
gagagcgccg gtcatacgct cccccgttgc 2400 
ccgcgttgca tcgagtgcag cgtcatctgc 2460 
atatagtgtt ccagcaccag accgaacagg 2 520 
tccacggtca gttcgcaact gacgccccgg 2 580 
cggcgggtca ccggggtggt tttgcagcca 2 640 
ctgtcgtgag ccgggataaa cagattcagc 2 700 
tgatccagat cggccagcgg cagataatta 2 760 
atttcgcgtt cagccagcgg cggctgcggc 2 820 
gccaccggaa tcgccgcatc tacggtcaga 2 8 80 
ggcagatcgc^ggt tagtcac cattgccgtg 2 94 0 
ggcgcttcat gctgatcaac cagcgagagg 3 000 
gtgccatagc ggcgggcgtt ttctgacgag 3 060 
cgcccgtggt tgccttcgtc attattacgg 3120 
cgtgtggttt ccgcttccag gccttcgact 3180 
ggacgggtac gatccaccac cagatgctgt 3 240 
gtggtggtgc gaggaagcag gttgatcacc 3 3 00 
tccgtctgat gaatcagcca gtccggcggc 3 3 60 
acattgccct gcaccttttg taacccggca 3420 
cgttccggac aggcgaaaaa ttcatgcagc 3 4 80 
agcggtagca gcccctgccc tggctccagc 3 540 
acattcagtfc cgccgtcaaa gtgaccgggt 3 600 
tgtagcagct caaacaggtg tgacgcaatg 3 660 
cgtgccggac cggccagctc gctgaaagtc 3 72 0 
aatgccccgg cgacatgaat attaggcggc 3 78 0 
ggcgcggcgg tcaggcgtac ctcctcgatg 3 840 
ctgcgaaact ggcaggcggt attttcgcct 3 900 
cgcggcacgg tgaccccttt cgccaggtcg 3 960 
actgccatcg atggcgtggg ggtgacgtaa 4 020 
gtgaagcggg gaaactcggc gtcaattttt 4 080 



1 



agctgagtgc gggcgctcag aaagctgaac 
gcaatatcgg ttccctgcat ccccagtcgg 
tcagcaccgg tctcccgcag gtagctcagt 
tccatgaata atgccctgta ttaaaagaac 
ctgcgtacca gaaactccgt gggatacggc 
tgtagcgtgt tatagctacc ctgccgggtt 
gcggcgttca gccggggttc gaaacggata 
tcccacttat gctcatacat aaagctgccc 
gacgccgccg cctgcggata gcgccgggcg 
agcaaaaagg agagatcccg gcgaatgatc 
t cccggt caa t tftt c tgat a~~cggagca 1 1 g 
aggtggttag cgggtgtaaa acgagacgtg 
aggtacaatg ggccatgtcc agcaggctga 
gccccagcgc ccgcacggtg gtttcgccgg 
gacgcagggc gtcggatgca ctttccgaac 
gcgtatcgcc attcaccagg gtgatattaa 
tcggttgcgg cgattccagc gagcgtattt 
cggtgactag ctcaagtacc gggccaaggc 
cgccgccgtt ccactgcccg cccgtgtctg 
tatcaacctc accggtatcg tcatgacagg 
c 999ctgcgg cagaagaaaa ccgggtcgtt 
tttcgcagcg aaccagctcc cggtacagcc 
tggcgcatag ctgaagctgg tgtagcgccc 
gaaacaggct gtggcggcag agcgctttcg 
tctgaatccc ctccgcaata gagtattcct 
tgtcagtttt tttcatgggc ggtatttcca 
ctatcgatgc caataatacg atgctcgcgg 
gcggttttcc cccccagttc cggcgagagc 
gccagccagt gcatttcacc ctcaccggtc 
ccgggcgcgc ctgccaccat atcctgtaac 



gcctcgatca tgcgttccac atacgggtcg 4140 
gcggcaattt tgggatggag ggtggcgaac 4 2 00 
tcgcggttgt aatactccag tagccgtgga 4 260 
gtaatgcggc tcagctccat atccagcgcg 4 3 20 
tgcgtcagaa tttgtccgcg aatttcaaac 4380 
ttatcgagca agggcgtgac cctgagtgtg 4 44 0 
atggcgcgcc ggatcgcctc gctgatatcg 4 500 
gccagcggcg gcaggccata gttgagcact 4 560 
tcgatgtcac cctcgtggct aatggtattg 4 620 
tccttcagtt gtaccggcgt gacgctgata 4 680 
tcacacagcc - gatcaaacag "cgtgggcagc "4 74 0 
ctcatgcgcc atcgttttcc tgagcatgaa 4 800 
tatcgccgtg gctggtcagc cacactttct 4860 
ggccgtcctg ccaggcggtt tccctgcaca 4 92 0 
cgctgtaacg ggtaaagagc caggcgccgt 4 980 
cgggtttcca cagcagatca gtcaggcgcg 5 040 
gcgaaaacgg cagccagata tacacgccgc 5100 
gggaatcgct gtcgctcgcc cagtcaaatg 5160 
ttatggcttc cagtgcggta ttacggtgtt 5220 
cgagtgccgc cagcagtgac tccacccaaa 5280 
gttcaccctg aaaaacggtg tggcggaaca 5 340 
gggcctcctg ggtataattg gcctccatcc 5400 
gcgaccagtc tccggccaca cacagcaact 5460 
ccggattttc ccgaacctgc tgctccgcca 5520 
gtatcagcgc ggacagggta gcaggaagcg 5 580 
tttttctgtg tcggagtgat tcggtagtgg 5 640 
cgggtcagat cgggcagtat ctcgctatgt 5700 
agttgcagga tatcgggcat tgattccatc 5 76 0 
gtgtccagcg tatccagaat ggcatcaata 5820 
gtgtcggtgt cgcccttttt atcgtataac 5880 



gacgtcaaat cctgggtgac agcgtcttca tttcggggaa acggtttagc acggaggggc 5 940 
ttttcctgcc cggagggccg caatgcctgc tgatattcct ggtaaagctg gtgtagtgtg 6 000 
ctatccgctt catggccagt gtaagaggat gcggtttccc ggacgggaat aatatcgaac 6060 
gggttctgcg gattgagctg ttgttgttta aaccagtcca gatccaggta ctcagcaacg 6120 
gattcagatg actgaaccaa ctgaggaaac gaaacatcgg gacgggactc atcattggtg 6180 
cgggcaaggc accaggatga cagaccccat tcaatcgtat ctccatcatt cagccgcata 6240 
cggtgatgag gttccatcac ctgctcattg acggcacagc acagttcatc agagtgatta 6 300 
acaatccacc acgcctcttc gtggcgaaca agactgagcg cgatatcgtc tttcccatgt 6360 
tgctcaaaaa gcggaatgta gggggcggcg gcagtaaata cgatgtatgt ccctgcggga 6 42 0 
tactgtgcgt tatccgttat accctgcgac tgtattttcc gtaattccca catgtcttgt 6480 



ttcattatgt cccttaaatg tatttatttt tctggaggaa acgtttaggg agttttaatt 6 540 
cataaataat tttaactaaa tttataggga gtcattattg atgacaccct ttttattatt 6600 
ttcgtcgtga atgcattgtt gtatgcatag atgtcttttt tgaaatatta tttcttttaa 6 660 
ttctgcaatt gcgattttta tcttgctgtt cattgattat ttaatattaa cggcttagtt 6720 
gctaattagt ccctgatctg tatcattgtt ttgtttcgat attttttcga ggctatcaat 6 780 
aaagaattgt ttatatattt atatgcattg atgcattatt tttatgaatt tttatgtcac 6840 
aaggcataac acatggaaac tcctgtttca cgcagtgcgt tgtatggaaa actggccggc 6900 
ccactattcc ggtcgctgga atcggcaacg gcattttgca aactacgctc taatccctgg 6 960 
gttgagctga ctcactggct gcaccagtta acacagcagc ccgataacga tattctccac 7 020 
gttcttcggc attaccagat ccctctttct gatgtggaga aagcgttact ccggcaactg 7080 
gatatgctgc ccgccggggc cagcgccatt agtgattttt ctcaccatat cgatctcagc 7140 
gttgaaaagg cctggatgct ggcgagcgtc cgttacggcg ataacaaaat tcgcagcggc 7200 
tggttgctgc tggccttgtt gaccacgcca gaactgcgtc gggtactgag cagtatctgc 7260 
gcgccgctgg ccacgcttcc ggttgatgaa ctgacggaaa tactgccctc gttgatcgaa 7320 
acatcgccgg aagcgcagga gcgcccttac gacggctccg gcctggcatc agccattccc 7380 
ggtgaaagca gtcaggcgat tcccaacggc gggcaggacg gtaaatccgc gctggcaaaa 7440 
tactgtcagg acatgacggc acaggcgcgc gacggcaaaa tcgacccggt gacggggcgt 7 500 
gagcatgaaa tccgcaccat gacggatatt ctgctgcgcc gtcgccagaa taatccacta 7 560 
ctgactggtg aggcgggcgt cgggaaaacg gcggtcgtcg aaggttttgc cctcgcgatt 7 620 
gcgcaggggg aagtgccgcc cgcgctgcgg gaagtacggc tactggcgct ggacgttggc 7680 
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gctctgttgg ccggagccag catgaaaggc gagtttgaat cgcgtctgaa agggttactg 7740 
gaagaggccg ggcgctcgcc gcagccggtt attctgtttg ttgatgaagt tcacactctg 7800 
gtgggcgcgg gcggcgcatc cggcacgggc gatgccgcta acctgctgaa accggcgctg 7860 
gcgcgcggca ccctgcggac tatcggcgcc accacctgga gcgaatacaa gcgccatatt 7920 
gagaaagatc cggcgctgac ccgtcgtttt caggtgttgc agattgccga accggaagag 7 980 
atccccgcaa tggaaatggt gcgtggtctg gtggatacgc tggaaaaaca ccataacgta 8 040 
ctgattctgg atgaggcggt acgtgcggcg gtacagcttt ctcaccgcta cattcccgcc 8100 
cggcagttgc cggataaggc catcagcctg ctggataccg ccgcggcccg cgtggcgctg 8160 
acgctgcaca cgccgcctgc cagcgtacag ttcctgcgcc agcagctaaa agcggcggaa 8220 
atggaacggt cgctgttgca gaagcaggaa aaaatgggga ttcagtcaga tgagcggcgc 8280 
^itgcgctga tggcgclfaat" tttctcgctc Taca atgaa c _ t ga c t gcatc cgaatcccgcna 340 
tggcagcggg agctggaact ggtacatacg ttgcaggaac tgcgtctcgc agagtctgat 84 00 
gctgatgaca aaaccacgct gcaacaggcc gaaacggcgc taagggagtg gcagggcgac 8460 
gcgccggtgg tgttcccgga agtcagcgcg gcggttgtcg cggcgattgt cgccgactgg 8520 
accggtattc ctgctgggcg catggtgaaa gatgaggcca gccaggtgct ggaactgcct 8 580 
gcccgactgg cgcaacgcgt taccgggcaa gacggcgcgc tggcgcagat tggtgaacgt 8640 
attcagaccg ccagggcggg actgggcgat ccacgcaaac cggtgggcgt gtttatgctg 8700 
gccgggccgt ccggtgtcgg taaaaccgaa accgcgctgg cgctggcgga ggctatctac 8760 
ggcggcgagc agaacctggt aaccatcaat atgagcgagt tccaggaggc tcacaccgtt 8820 
tccacgctga aaggcgcgcc gcccggctat gtgggctatg gcgagggtgg tgtgctgacg 8 880 
gaagcggtgc gtcgccaccc ctggagcgta gtgctgctcg acgagatcga aaaagcgcac 8 940 
catgacgtcc acgaactctt ctatcaggtg tttgacaagg gtgggatgga ggacggcgag 9000 
ggtacacatg tcgatttcaa aaacaccacg ctattactca ccaccaacgt gggttccgac 9060 
ctcatcagcc agatgtgtga agatccggcc ttaatgcccg atgctacggg gcttaaagag 9120 
gcgctaatgc cggaattgcg caagcatttc ccggcggcat ttctgggccg cgtgacggtg 9180 
atcccttacc tgccgctgga cgaaacgtcg cgtggcgtga ttgcccgtct gcaccttgac 9240 
cggctggtgg cgcggatgag tgaacagcac ggcgtgacgc tgacgtatag cgaggaactg 9300 
gtcgcacata ttgtggcgtg ctgtccaatg catgaaacgg gcgcgcggtt gctgattggc 9360 
tacatcgaac agcacattct gccacgactg tcgcgctact ggttgcaggc catgacggaa 9420 
aaagccgcga tcaggcagat tgatatcggc gttaatggtg atgagcagat tgtttttgag 9480 



atcgtttgct gaaaccggcc gttcgaagtg tccgtagtgc gattttaaaa actgtaccgg 9540 
tataccgctc cccttgcggc aaccagttga ctaaaaagga aatgaaggat tatggctatc 9600 
aacaatagcg cgcagaaatt catcgcgcgc aaccgcgcgc cgcgcgtgca gattgaatat 9660 
gacgtagaga tttacggttc cgagaaaaaa atcgagctgc cgttcgtgat ggcggtgctg 9720 
gccgatctgg ccgggaaacc gcgtgaagaa ctgccgccgg tgacggatcg caaattcctc 9780 
gatattgata ttgataactt caatgagcgc atgaaagcca ttgcgccgcg cgtggcgttc 9840 
gctgtgccga atacgctgac gggtgaaggt cagttgatgg tcgatatcac gctggaaaat 9 900 
atggacgact tttcgccggc acagattgcc cgcaaggtgg acgccctgaa ccagttactg 9960 
gaagcccgca ctcaactggc gaacctccag acctacatgg atggcaaggc gggggcggaa 10020 
aatctggtca ataaactgtt gcaggacccg actctgctga aaacgctggc gaatgcgccg 10080 
Taalcggccg ctacccagcsTagatg tg t ca~ gcgga t Taa t g aat cagcgga ataacgtcga 10140 
atttttaagg aattttcatg gcaaacagta atatgcaggc aaccgacgcg gttgctcagg 10200 
ataccgcctc cgcatccggt gaatttgatg cgttgctgaa tcaggccttc cgacccaaga 10260 
ctacccaggc ggcaaaagcc gtggaagccg cggtgcagac gctggcgaac acgatcaccg 10320 
tcagcgatga cgcctataaa agcatcagcg ctattattgc gcagatcgac tttaaactga 10380 
ccgaacagat caaactgatc ctgcaacatc ccgactggca gaagctggaa tcctcgtggc 1044 0 
gcggtatgga gcatctggtt tacaacaccg agaccgacga aaagctgaaa attcgcttca 10500 
tgaatctgtc aaaagatgaa ttgcggcgca acatgaagcg ttacaagggc atcgcctggg 10560 
atcaaagccc gatgttcaag aaactgtatg aagccgaata cggccagtta ggtggcgaac 10620 
cttatggctg tatcattgcg gattactact tcgaccatac accgcccgat gtggatctgc 10680 
ttggctctat cgccaaagtc gccgcgtcgg cccatgcgcc gtttattgcc ggggcttccc 10740 
cctcggtact gcaaatggac tcctggcagg aactggcgaa tccccgcgac ctgaccaaaa 10800 
tcgtcaccca gaacctggaa tatgcgccgt ggaactcgct gcgggctagc gaagactccc 10860 
gttatattgg cctgacgatg ccgcgttttc ttgcccgcct gccgtatggc gcaaaaacca 10920 
acccggtgga cgagtttgat tttgaagaag atgcggatgg ttctgaccat accaaatacg 10980 
tctggagcaa cgcggcctac gcgatgggcg taaacatcaa ccgttccttc aaacactacg 11040 
gctggtgtac gttgattcgc ggtgtggaat caggcggtgc ggtggaaaat cttccctgcc 11100 
ataccttccc gactgacgat ggcggcgtgg acatgaaatg cccgaccgaa atcgccatct 11160 
ctgaccgccg cgaggctgaa ctggcgaaaa acggttttat cccgttgatc caccgtaaaa 11220 
actcagacta tgccgccttt atcggcgcac agtcgctgca aaaaccacag gaatactacg 11280 



atccggacgc gacggccaac gctaacctgt ctgcccgtct accgtacctg ttcgcctgct 11340 
cgcgcttcgc tcacttcctc aaatgtatcg tccgcgacaa aatcggttcc tttaaagagc 11400 
gtgaggatat gcagcgctgg ctaaatgaat ggattatgaa ttatgtcgac gccgatccgg 11460 
tgaactcctc gcaagaaact aaagcccgtc gtccgctggc tgccgctgaa gtagtggtgg 11520 
aagaggtcga aggcaatcca ggttattacg acgcgaaatt cttcctgcgt ccgcatttcc 11580 
agcttgaagg gctgacggga tcgctgcgcc tggtgacaaa actgccgtca gtgaagcagg 11640 
gcaatgcctg atatatattt tgtgaatgtt taagcgagtg aagtcagaga agatagagaa 11700 
tataaagagg gatatgaaga aaagaatttc gtctcgccca cggtctcgta aaggtggggt 11760 
acgtaatgat gacacatatc cgaatgccag taacaatgcc gaagcttttt atatcattga 11820 
gtaggaaata catattatga ccataagccc aacttttcat ctgttacctg gtattgttct 11880 



^tc^ttca^aatatgctg tagcctggga "agtcagttgc^ccggctgtta ttgatactca 11940 
gtcttctgct gtgagcctga agtctgatgt cccagcggcg tggcagcttt ctccccgata 12000 
tatgtcgcgt ttatggttaa gtagtattgg ggtaacgcag ggtaaacctg aaaacctgat 12060 
ggatctcaaa ccagagacta aaaaagtaaa cggtgaaaat tggtctgtat gggaaacaga 12120 
acgtggtagc gataaagaaa ccgatcgcta ttgggtttcg tgtatttatg gtcatgaaca 12180 
gatatggttg acgcaaccaa tacctgcttc ttctactcgc tgtaagactc gtaattttga 12240 
gggatcgcca gaagaccagt ctgtatcttt tatctgtaat tagcgatttg agacgtgaaa 12300 
atttcagtac aggttatggt ttttattatc ggaagttatg aagcattatt tatatgcatt 12360 
aaataatgca aattcataaa ataactaaat acattatcgg taccggaaaa atatacagtc 12420 
ctctgttctc ctgaagttat tggagaagga ttctgtacgg caatgattta tctataaaca 12480 
aaaagatata gataaaatca ggtttatttt aagtaaaact taataaggat ataaaaatgg 12540 
cttatgacat ttttttgaaa attgacggca ttgatggcga gtcaatggat gacaaacaca 12600 
aaaatgaaat tgaagtactg agctggcgct ggaatattca tcaggaatcc accatgcacg 12660 
ccggtagcgg cctcggctcc ggtaaggtct ccgtcaccaa cctggatttt gatcactata 12720 
tcgaccgcgc cagcccgaac ctgttcaaat actgcgcctc cggcaagcac attccgcagg 12780 
ccattctggt tatgcgtaag gctggcggca atccgctgga gtacctcaag tataccttca 12840 
ccgacctgat tgtcgccgtg gtttccccga gcggcagcca cgatggtgaa atcgcctccc 12900 
gtgaaacggt ggagctctcc ttcagcaccg tgaagcagga atacgtggtg cagaaccagc 12960 
agggcggcag cggcggcacc atcaccgcag gctacgactt caaggccaac aaagaaattt 13020 
aacggctgtt tttccggcca gatgttatgt ctggctggtt ttattgtttt gattttaaag 13080 



gaatttacag tgaataaatg gcgtaacccc actgggtggt tatgtgcggt agctatgcct 13140 
tttgcactgc tcctgctttc cggatgcggc agtagcgatt cgctacttga cccctaatcg 13200 
cagcggcctg gcctgagcgt gaaagcgttt tacaaggtga attctgacaa tcagaagaaa 13260 
gcggcgtcca tgaagatacg tgttgagaat taatgaccta cacagaattt ttagaggtta 13320 
agcaaaatga acagaccttc attcaatgaa gcgtggttag cttttaggaa ggtgaatcat 13380 
tccgtcgctg atgtgggtag cattattggt ggaaacgttg ggaaaaatat aactggtggt 13440 
tattttcaaa atgcctgccc tattcgaatg agctatgttt tgaatgcgac agggttccca 13500 
atagcccgta actctccgta tgcaaaggtt agtggtgccg ataataaatt ctatatttat 13560 
cgcgtgaatg atatgattga ttatcttact catactatgg gcaagcctga tcttattgtt 13620 
aataatccga aacagagtga ctttatcggg aagaaaggaa ttatcgtagt aaaagggcat 13 680 
~~ggc t gga gc a atgcc a gagg acacg t tacaT~t t a t ggaa t g gcag t" a t c t g 1 1 cagatcag "1 374 0 ~ 
tgccacttat taaatgaccc agataatgga ccatttgttc ctgaagttgg gacactgtgg 13800 
atactgccgt gaaatggtta atattagtta ctttcagcat aagtggaatg ctggtatggc 13860 
agccatcctt tgcacaagaa gcattgacca cacaatattc acagtcggaa ctcttaaaaa 13920 
attgggcgct gagtcattgt ctggcattag tatacaaaga tgatgtcgtt aaaaacgatg 13980 
ccagagctac ggccagtgct taccttgaat atggtaaaca atctgtggag atttaccatg 14040 
aaattgatga gattgcgaaa aaatattcag ggttgaaata taacggttcg atatcatcag 14100 
attttaatac catgaagtgc atagatttta tccatgacag ggaattaaat gaattaatta 14160 
aaaggcgtgt cgagaagtaa aattcaaaga tattaaaagc atacgttctc ttgctctgat 14220 
gaatttatgg gtaagaaaga gctgtacagg aatagttaat ctgttcacct aataaagcag 14280 
ataaatcagg gcttaattta ggtagttaaa aggatagtag atatgtctta tgacattttt 14340 
ctgaaaattg acggcattga cggcgagtca atggatgaca aacacaaaaa tgaaattgaa 14400 
gtactgagct ggcgctggaa tattcatcag gaatccacca tgcacgccgg tagcggtctc 14460 
ggttccggta aggtctccgt cactaatctt tcatttgaac attacatcga tcgcgccagc 14520 
ccgaacctgt tcaaatactg ctcttccggt aagcacattc cgcaggccat tctggttatg 14580 
cgtaaggctg gcggcaatcc gctggagtac ctcaagtaca ctttcacaga tctgattatt 14640 
gcaatggtat cgcccagcgg aagccaggga ggggaaattg cgtctcgcga atcaattgaa 14700 
ctctccttca gcaccgtgaa gcaggaatac gtggtgcaga accagcaggg tggcagcggc 14760 
ggcaccatca ccgcaggcta cgacttcaag gccaacaaag aaatttaacg gctgtttttc 14820 
cggccagatt tatatctggc cggatttatt attttgattt taaaggaatt tacagtgaat 14880 



gaatggcgta accccactcg gtggttatgt gcggtagcta tgccttttgc actgctcctg 14940 
ctttccggat gcggcagtag cgatgcgtta cctgacctcg aatcacagcg actcgacctg 15000 
agcgtgaaag cctccgataa ggtgaatcct gacaatcaga agaaggccgc gcccattgag 15060 
atacgtgttt atgaactgaa aaatgacgcc gctttcacga cagctgatta ctggtcgctc 15120 
catgacaacg acaaatccgt ccttaccgac gatttagtgc gtcgcgacag ctttattttg 15180 
cgtcccggcg aagagaaaaa actgcgtcgc ccgctgaatg cgcagaccac ggcaatcggc 1524 0 
gtactggccg gataccgtaa cctggccaaa tcggtctggc gggtaaccta caaaatcccg 153 00 
gaagccccgg aaaaagcctg gtacagcagc ttcatatcgg ggaaaggaaa agtgcagttg 153 60 
gaggcggaac tggaacaaag cgccattgta attacggaac gggataaatg aattatgagc 15420 
tggaatgacc gcgtagtctg gagtgaagga caatttttac tgccgcagat gtttcagcag 1548 0 
caagagcgtt atctggaaca cgtcatgcat taccgcagcc tgccgctgac cccctttttc 15540 
tggggattca gccactacaa tattgatggc gaagcgctga acatcggtaa actgatactg 15600 
aaagaggcat cagggatttt tcctgacggc acgccgttta acgcaccgga ccacaccccg 15660 
ctgccgccgc cactgaccat tctgccggag cacctgaacc agcagatttg tctggcggta 15720 
ccggtacgcg cgccgaacag cgaagaaacc acgtttgaca ataacccgga atcattggcg 15780 
cgtttctcgg tacatgaaca cgacatccgc gacgccaact cgctgggacg tggcgcgcag 15840 
ttattacagc tcagtcatct gcgcctgcgg ctgctgccgg aaaaggcggt gacgggcgcc 15900 
tggattggcc tgccgttgac ccgcatcacc gggttgaacc ctgacgggcg gatagatatc 15960 
gaccacgacc tgatcccgcc catcattaat tatcaggcca gttcactgat gtgtacctgg 16020 
ctgtcgtgga tcaacgatct catccggatg cgggccgatt cgctggcgga acggctgacc 16080 
ggcagcgaca accacggcca tgaagcagcg gaggtctccg attacctgct gctgcaaatt 16140 
ctcaatcgct ttgagccgct gctgactcac ctggcgaaaa ccccgctggc cccggaggtg 16200 
ctgtaccgct acctgtccga actggccggg gaactctcca cctatgtgcg tccacaaacg 16260 
cgacggcccg ctgaatacaa agagtacaaa cacctgacgc cctatgccgg gttgaaatcg 16320 
ctggttgatg aggtgcagtt cctgctgaac gcggtactga tccggggcgc gcagcgcatc 16380 
gagctgaaag aggggactta cggcatcctg aatgcggtgg tggccccttc cgatcttgcc 16440 
gatttcagca cgctggtact ggcgataaag gcttcaatgc cgaccgatgt gctactgcaa 16500 
cattttgccg cccagaccaa aatcgggcca tccgatcgcc tgccggaact gatccgctcg 16560 
catctgccgg ggctggcttt gcaggttctg cctgtaccac cgcgccaaat cccgtttcag 16620 
gccggataca tctattacga catccgccgc gagggagcat tgtgggaaca cattgcccgt 16680 



1 



tacggcggga tggccatgca taccgccggg gaatttccgg ggctggagac agaactgtgg 16740 
ggagtgcgcg ataaatgaca gacagtaccc tgacgccgcc agcggcggat atgatgtcct 168 00 
ttttgtccac cacgccggaa cataaggaca gtgaatatga aacgccggta cacaccagcc 16860 
agcgcacgga actcaatgtc atcgttgaag acggtccgga cagcaaactc cggctggctg 16920 
aaatcagcgc ggcggctaac ccgttgctcg ccgctgcccg gcctttattg tgcgctctcg 16980 
cagccatgcc cgctaaactg gatgcggccc tggtagagcc ttaccgtaat ctgctggtac 17040 
gcgagatgca tctgtaccag acattatgcg atcaggcgaa cctgcggcgc gagcacgtac 17100 
tggcggtacg ttactgcctg tgtacggcgc ttgatgaagc cgccaataac acaacctggg 17160 
gacggcgcgg cgtctgggcc ggaaaaagcc tgctggtaac atttcatggt gaaagcgaag 17220 
gcgggataaa acttttccag atcatcgggc gtctggcggc cagcttccag gagcatggca 17280 
"acgt actgga ggttat c tac cacctgctgg ggttgggatt tgaaggccgc" tacagcgtgc 17340 
agccagacgg gcgtaagcaa ctggacaata ttcgccagca actgctgaca cagctttcac 17400 
agcgtcgcga tccggttatg cccgcgctct cgcctgactt tcagggggcg ataagcggac 17460 
gactgcggcg gatgcgccgg gtgccggtct ggctgagcgc cgggatagcc ctgttggcga 17520 
tgctgacgct gtttggcctt tacagccacc ggatggatgt gcagaccgtc accgtacaac 17580 
agcatattga tgcgattggt ataaaactgc cgccgccgcc tgtgccggtt cataagctgc 17640 
ggctgaaaat cctgctggca aacgaaatcg cccgtggcct gctgaccgtg gacgaagatg 17700 
accagcacag tagggtggtc ttccgtggcg acgccatgtt tgtgccggga cagaaaacgg 17760 
tgagtgacgc aatccggcca gtgattaaca aagcggcgcg ggaaatcgcc cgcgtgggcg 1782 0 
gcgcagtcac tgtaacgggt cacactgaca gccagcccat tcattcggct gaattcccat 17880 
ccaacctggt actgtcggaa aaacgggcgg cggaagttgc ggcgttgctg acctccggcg 17940 
gcgtacctgc cggacgggta catatcgtcg gcaagggcga tacggtgccg gtggcggata 18000 
acggcagtaa agccgggcgg gcgaaaaacc gtcgggtgga aattctggta gtggagtgaa 18060 
tgaatgatga aaaaatcaac ctatgatgtg tctcatcatt cggcagtatg tggcgtgacg 18120 
ggggattatt atcggatctc agcgacatat cacataacac gatctgttcg tgtttttttg 18180 
atcatcttat gttgcctgtt atccggtggc gcttttgccg gatccccgat taacgcagga 18240 
ttcatttccc ccgataatgt caacctcagt actcaggatt tcctgaaatt ttatgccact 18300 
gacaacgtac agaaaaaaga caatgcactg atgtatatgc tgggggtgga ggatgcgaca 183 60 
gaaggtaaag cctggtgtgg atatggtcag gttgacagta taacaataaa ccatactgtg 18420 
ctgacctggt ttgaacagca cgcagtgaaa aagcctgatg taagggcttc aatactaata 18480 



gaggaagcat tagttaaaaa ttttccctgt 
tcccggtcat ctcccatttt atccctgacg 
ttttttaaat tttgggtgtc tggtaatcaa 
ctcggcgtgg aggatgcgac agagaacaaa 
ctaacattaa atgaattagt ctatgtttct 
tctcgcgcgg ctgaacttat cataaataaa 
cattcaagtt gcatcaaggc ggcaagggag 
actggggtga gcgaggaaag ccaacgcaca 
tgataatgtt atcgctgcca gttcagataa 
tattgatatg tttaaggtaa tggtctgtct 
gcacgattat gggtgtgcta cggttggagc 
aaaaaatgac ctgaatatag atgtcgctac 
tcttgatatc tcacctgtat ctaaggtcta 
aaaagataag gccaaaaata aagtggcaat 
ctatgaaaat caggttaaat ctatcgtggc 
aaaagatatt ttcattgcat ccagcttcat 
tggttatatt actttatcca gggaatttta 
tataatgcaa aaatttctta gtctgctttt 
tctggttctg gcgctgctgg tctggtttgt 
cccgctggcc tccgtgggta gccgggtagt 
actgtggctg gtcaactggt cgatgagtat 
tggcttcgtc acaccgctgc tggccctggg 
ccgcctgacc ctgattggtt tcatcctgct 
gtggcgtgcg ctgcgtatgg atgaacaact 
agaggtaccg gtggcaggcg agatcaaagc 
gcaggccatc cggcagctgc ggcagttgcg 
cgagggaaaa cgctttctgt atgagctgcc 
cggcaaaacc acggccctgc tgaacaccgg 
gcagacttcg cgcatcctga cagtaccggg 
taccaacgaa gcggtgttga ttgataccgc 



cagaggacag actcctccat aaaaattgct 18540 
ccggatgcgc ttaatctttc aggtaatgac 18600 
cgggataaac tcagggcggg tgtctatctg 18660 
ctgtggtgtg gatacgcttt atttaagacg 18720 
cttaaaaata aaaccaatga ggaactgaat 18780 
ttaatagagt atccctgtaa tatataaaat 18840 
tgaatccccg ggagcgtaca ttagttcgtg 18900 
tgcagcctga agtatgacag gtataccctg 18960 
agtcttggtg gtaaagtctt ggtggtaaga 19020 
gttggtgggt gttcctgcca tttcttatgc 19080 
atcaatggag tcatcattat ttgatgccat 19140 
tattataaag gataaaacaa aggtagagat 19200 
tgcagaatct ctggccagga tggattatga 19260 
attagataaa aaatcctatt tcgatagtta 19320 
aaaatatacc tatattaaca aagataaaga 19380 
gaatgctgat gagtgttctg taagatttaa 19440 
aaatattgat tattgggtgg cgagtcgaaa 19500 
ttcccggcgc gcgctggcag ttgtgggcgt 19560 
cgggccgctg gtgtcatttg ataccctgcg 19620 
gaccattgcc ctgttgctga tgctgctggt 19680 
catcggcatc agtgtcctgt gcctggcgat 19740 
cgatgtccat ccgtttgcgc cgctgtgggt 19800 
gatgtacgcg ctgtacggcc tgtaccggct 19860 
gctgcgtcgc ttcctgcatc cgcgcgggga 19920 
cgacctgcgc accgtcaacc atattgtcac 19980 
ggtggatatg cctggctggc gtaaaatctt 2 0040 
gtggttcatg gtggtcggca gtcccggcga 2 0100 
attgcagttc ccgctggcgg agcaaatgga 2 0160 
tggcggcacg ctacactgcg actggtggtt 2 0220 
cggacgctac gcccgccacg atgacggtgg 2 0280 



tgaagcgagc gccgcgcagc gtaacgccgg agagtggcag ggctttctcg gtctgctgcg 2 0340 
taaacatcgc cccggcgcgc cgcttaacgg cgtgatcctg acgctaaacg tggcggattt 2 0400 
aaccgcacag tcaccggcgg aacgcctggc ggcctgcgcc gctctgcggg cgcgactggc 2 0460 
agaactgcgc gagaccctgg ggattcgctt tccggtctat ctggtggtca ccaaaatgga 20520 
tttgttgccg gggttcagcg aatattttcg cacgctgacc agccatcttc gtgcacaaat 20580 
ctggggcttc acgttgccgt acagccgcag gcgaaaagcg ggcgacccgc aggcgctgca 2 0640 
cgccgcctgc gcgcaggagc tggcgcgcct gacgctgcgg ttggatcagg gactggatac 2 0700 
ccggttacag gaagagtacg accttaaaag ccgccagcgg ctgtatacct tcccgcgtga 2 0760 
gttcgccgcc ctcggcgagc cgttgctgga ggctattgaa cagatcttcc tcgattcaaa 2 082 0 
attcgatgcc acgcaactga ataacacgct gcgcggggtg tttttcacca gcgccgcgca 20880 



"ggcgcaggcc gacgccgtgg 7 ccgaccagtt gagtatctgg cagcgctttg tccgggcgat 2 0940 
aaaaaccgcc cgtggcgaat cctccgcctc tctcccacac gctctgccgg acggcaaccg 21000 
cagctacttc ctgcatgacc tgctgacaca gttt atttt t cgtgaagcgc acctggtgga 21060 
gccaaacctc cagtgggcct ggcgttaccg cctgctgcgc ctcggcgggc acctgctggt 21120 
actggtgctg gcattcctgt tgtggcaggg gatgcagacc agccagcaga ccaacggcga 21180 
ctatctgaat gaaatcagcg cccgcgcgac ccggctggac ggtgatgtga aagcctacac 21240 
cggtaaaccg gcgatggctc ccgtcccggc actgctggac agcgcaaggg aactgtccgc 2 13 00 
ctggccggaa ctggacccgg acgcgccgcc gctggcctgg cgctacggtc tgtacagcgt 213 60 
accgccggta accgacagcg tggcgtcgct gtacaaccgt ctgctggatc aactgctgct 21420 
gccgccgctg gtgaaacgga tggagtatgt gctggcggac gccattgccc gtcaggatag 21480 
taaagcggcc tacgatgccc tgcgcatcta tctgctactg aatctggata aagatcacga 21540 
agataaatac aacgcggcgg agatccagtc gtgggtgatt aacgatctgg ggaacagcga 21600 
cagcgtggcc gggttcggcg ggcgcgccgc cgtgctgacg catatcgaag cgctgtttga 21660 
cggcagccgg gtggtgcatt caccgtatga gaaagatgag gcgctgatcc gccaggcgcg 21720 
ggcattcctc gacggtcaca ccagtaccga gcgtatctac gcgcgggcgc tggcggcaat 21780 
ggagagcgaa gcgccgcagg agttcacgct ggtacgcgcc gtcggcgcgg atgcgggaac 21840 
ggtctttgtg cgtagcaacg gcgcgccgct ggatcggggc gtgccgggta tttttacccg 21900 
tgaaggatac cgggagctgt tcgacaaacg attaccggaa tttgtggcgg cggcgacggc 21960 
gaacgatggc tgggtgatgg gccgggagag tacgccaaaa aagctgactg acagcctgcg 22020 
cagccagata ccggggcagg agcagtctgt cgcccgcgaa gtccgccgtt tgtacctgac 22080 



ggaatatgcc cgccgctggc aggattttct ggacagtatc catagtatca acagtgccgg 2 214 0 
ggaagagggc agttccggcc tggcctatga tttacaggtg ctgcgcaccc tggcgtcgcc 2 2200 
ggactcaccg ctgatgcggc tgggaaaagc ggtggtggag cagaccacgc tggtgccgcc 2 2260 
gccggacccg caggccagac agaaacaact ggcgcagcgc gcatccggta acgcggggaa 2 2320 
agtggtacag acggcaaaac tgttccagga tattcacccg gaagaacggc tggaaaaaac 2 23 80 
gctggtggat gatcgtttcg ccgcgctgcg cgaggtcatt gccgggcgga cggacggcgg 2 2440 
acagagcggc ggtgggacga tgcagatcgc ttcgctgctg accatgctca acgagtatta 2 2500 
cacccaactg accattgctg atagcgcgct ggcggcgggg acgttgccag cgcgcattac 22560 
tgccgcagac aagttgcaac tggaggcggc gaaactgccc gcgccgctga aaaacatcct 2 2 620 
gctggatctg acgaagcagg gaacgcgcaa aatcaacgcc gggaccggcg acgtgctgaa 2 2680 



""cacccagatg gaggcgatga tgggcgacga ctgccgtgac gccatcgacg ggcgctatcc 2 2740 
gttcgccgac agtccgcagg aggtcagcgc cgaggacttt aaccgcatct tcgccagcgg 2 28 00 
cggcgtactg gatgctttct ggagcaaaca actggctccg ctggcggaca ccgccagcga 22860 
cccgtggcgc tacaaaccga ccgaaggcaa catgacattg caggggccgg atctgacgcc 22920 
gtttcagcag gcgaagcaga tccgcagcgt attttttaac agcgagggcg ggaaaaaatt 22980 
ttcctggtcg atgcagatta gcgtggtgga tatggacccg gccatcacgg aactggtgat 2 3 040 
tgatatcgac ggccaggtgc tgcgctacgc ccatggtccg gaccgtcccc tgaaagtaac 2 3100 
gtggccggga ccgcgcaacg gctcgatggc ggaaatcacc gccagcccgc gtattcgcca 23160 
ggacacctca acgctactga ccggcgggcc gtgggcgctg tttcatctgc tggacgccgg 23220 
aatggtacag gaaacggcgg tgcgggggcg tcaactggtg gaatatgatt ttgacggtcg 23280 
ccgggtggtg ctggaaatca ccgccgggcg ggattttaac ccggtcagcc gggagctgtt 23340 
gcagaacttc agttgcccgg cgagggcgct gtaatgcgcc ggccagcagt ggtatcagcc 23400 
ccggcgatgt tcggcaggct gccggaccag cgcgactatg tgcgctggcg agtgggcgct 23460 
gaagagggac gaatatggca ggactggctg aaccgccaga cgtgggtggg cagcgggcgg 2 3 520 
catatcgtgt tgccgggagg ccagacgacc gatgaagagc gtgacggctg gatgcacctg 2 3 580 
tcgccgcgtg gcgaacatga tatgcccgcg ccggaacacc atccgctgcc gtggagcttt 2 3640 
gtgatgtcgc cgggattttt gccgatgggt gggacgactg actggctgac cggagtgctg 23700 
atggcgtcac gggatagcgt gggacgcccg tggccgctgg tgatttacca gcgctgcggt 23760 
cgggagtggc tggacgagag tttgcaggag acgcagggct ggctgtactg gctggcgagg 23820 
ctggcggcgc agcatatcac gccggacacg atgcggcgcg ggcggctgac cgagcaggtg 2 3 880 



gaccagctat gggcgatgtg gcagccgggg ccgtggtggg cgcagtggct gcgtgggtta 23940 
cgacgtacat cgcaacgcag ccgggaactg accgggctgc cggatgaagc accggtagta 24000 
gaactgccgg gcgtgcgtta tctgccgtgg ccgggctggc cgggtaagac gctggggcag 24060 
gcaacaccgg ggcaaggctg gttctggcag cagaacagtg aagggcggta tgtggatgcg 24120 
ctgcggttgg tggatgaaaa aaatagaggt cagatgctgt gaaaaagtac gctgtagagg 24180 
ttctgtttat gtctgcctgt gcaggaatgt tcctgcccgt atttgcctgg ggcggaacag 24240 
atgtaaacat tgataacccg ctggcggagt gtgtggatat ccatcccgtt catcgtcagg 24300 
aaatggataa ccttaccatt ctgaaaacaa cggtcacact gaaaaaaagc accggtgaat 24360 
gtggctgttt ctcaacatta atcaactata ccagcttact ggcgcaggat gttgagggtt 24420 
acggacgcgg aagtgcatat tcccttcagg aggggaatat ctcactggcg aagatgcagg 24480 



gacgttatcc tttcagcttt gtattatcgg tggataacca gtctgtacgg gatcagaagc 24540 
tggcgctaat gatacgttgt acaccaccgc tgtaatacac agaatagtca gggagaagat 24600 
gatggcagta agactgactt ttgacgggca aaagctgaca tggcctggta tcgggatatt 24660 
taaggcgacc acggggttac cggatttaca gtggccagat aaacagtgtg tgccggatgc 24720 
ggcgataccg gaagggaatt ataaattgtt tattcagttt cagggggagg caccgataag 24780 
aaatgctgcg gattgtgatc tgggaccatc atggggctgg agtaccattc cgcgaggcca 2 4840 
ggctgccgga acatgtgaga tatactgggc gaactgggga tataatcgta tccggctgga 24900 
atcagcggat gagaagaccc gaaaagcctg tgggggcaag cggggtggtt tttatatcca 2 4960 
tgattccacc aaaggttaca gtcatggttg tattgaagtg gaaccggtgt ttttccgtat 25020 
tctgaaacag gagacggaaa aagaaaatgg tgaaaagaca tttacggtta atgttaagta 25080 
tgtttctggt cagcaaacga atggtggaac aaaacaataa taccgttaaa accggatgaa 2 5140 
attatcgtgt ctggacgtgc tgtggttgac aatgattaca atggtggcca gataagtaat 2 5200 
gagcaccaat gataaagtat atgactggcg gtgtgctggt tattaatcaa atttctataa 2 5260 
aatgcaatgc gaatctggta agcgtaataa aaataaaatc ataatattgt tatgttcatt 2 5320 
tccttattta tgtaattcag tatttatgtt atgtgctaat ttttgtgttt ttattttcat 2 5380 
ggctcttgtc agcaatatac cctgt tcttc tggtaaatat ttaaatataa caggctggtt 25440 
gcattataaa gtgcggggca ctgtttcctg acggtgagtc tatttatttt aatccggtat 2 5500 
taaaggagtc actaccatga gttttgtatc cacaaataat aaatccggta tgggagggct 2 5560 
gacgacaacc acgccgccga taaccggaga aagtggcggt gtcaccgcag attcagtcgc 2 5620 
cggaagcgtg gcagatgcgg cggaatccgc cgtggaacag gctgcgggat cgctatttgg 2 5680 



cgcattgccg gagccatcag gactggtgaa agccgcggta gcagcggcgc aggctgccgc 2 5740 
cgccgcaggt atggcgcagg atgcggtatc ggccatcgtc tctgctgttg caggcgggcc 2 5800 
gggggcgcat aatgtgacgg tcagcggcag cgccgtaccg ccgggcgcat tactgttcgc 25860 
cagcctggac ggcggcgaaa cattaagtga actgttcagc tatgtggtac agctaaaaac 25920 
gcccgacacc ctgaatctgg gctatgtctc cccggcggcc aacctgccgc tcaaaccgat 2 5980 
ggtgggcaaa gatctgtgcg tcaacatcga actggatggt ggcggtaaac gacatatcag 2 6040 
cgggctggtc acggcggcgc gggtggtggg ccatgaaggg cgttcggtta cctatgagct 26100 
gcgtatggag ccgtgggtaa aactgctgac ccataccagc gactacaaag cattccagaa 2 6160 
taaaaccgtg gtggatattc tggatgaggt tctggcggaa tatccctacc cggtggaaaa 2 6220 
gcggctggtg gaaagctacc cggtacgcac ctggcaggtg cagtacggtg aaactgattt 2 6280 

gal: 1 1 tWlT^agcgaW 
ggacagccac acgctggtgc tggcggatgc catcagcgcc cacaaagcat gtccggactc 2 6400 
gccgctggtc gagtggcacc aggaagggct gaagctggac aaggagttta tccacactat 26460 
cacggcaaac gagagcctgc ggactggaca gtgggtgctg gatgatttcg attttacgaa 26520 
gccacgttca ttgctggcaa acaccgtggc aaacccgcgt gaaaccggtc atgccaccta 2 6580 
cgagcactat gagtggccgg gagactactt cgacaagagt gaaggcgaga tgctgacgcg 2 6640 
cattcgtatg gaagcgcagc gcagccccgg cagtcgggtg ctggggggag ggaatatccg 26700 
cacactcatg accggttata ccttcacgct ggaaaactat cccaccgccg aagtcaatca 2 6760 
ggaatatctg ctgatgcaga ccttgctgtt tgtgcaggac aacgcgcagc acagcgggca 26820 
ggaccagcac tttacctttt ccacccgttt tgaactgcac cccacccgcg aggtgttccg 2 6880 
cccgcagcgg acggtgagca aaccccacac caaagggccg cagagcgcca tcgtcaccgg 2 6940 
cccggcgggc caggaaatct ggacggatca gtacgggcgg gtaaaggtac agtttggttg 2 7000 
ggatcgctac ggcaaaatgg atgaaaacag cacctgctgg atacgcgtca gctacccgtg 27060 
ggcgggcaaa ggcttcggga tgatccagat cccgcgtatc ggccaggaag tgctggtgga 27120 
tttcaaaaac ggcgatccgg atctgccgat catcgtgggg cgtacctaca accaggacac 2 7180 
catgccgccg tggggactgc cgggaatggc gtcgcagagc gggatcttca gccactcgct 2 7240 
gtatggcggg ccaacgaacg gcaacatgct gcgttttgac gacaaaacgg gcgcggagga 27300 
agtgaagttc cacgcggaaa aagatctcaa caccacggtg aagaataatg aaacgcatac 27360 
ggttatggtg gatcgcacta aaaccattat taaaaatgaa accaacagta ttggtgagga 27420 
cagaaacacc acggtaacga agaatgacgg cctttccgta aaactggcgc agacgatcaa 2 7480 



tatcggcacc acttatcgtt tagatgttgg cgatcaattc acgcttcgct gcggcaatgc 27540 
ggcgcttgtt ttacataagg acggctccat tgagttttgt ggcaagcaac tgatgttaca 27600 
taccagcgat gtcatgcaac tgattggtaa aggtattgat atgaacccgg atggcggcac 27660 
agccgtaacc gccgatgata ttgcccccct tctcacctct gagtgatctg aattaaacct 27720 
ggagttctca tggatcgacc ataccgcata caggaagggt gttttgtcct gcctgaaaca 27780 
tttacggatc gcagcgtcaa tatttttatc ctggagggca atgaacgaac atcgcccagc 27840 
ctgaatattt cccgcgatac gctaaaacct gatgaagacc tgcccgccta tattgaccgc 27900 
cagattgcac tgatgaaaaa aaatctcggt cagcaccggg tattgtcgcg agcgcctgca 2 7960 
caggcaggaa cgggcaatga tgcccttatg ggggaacaaa ttgccgccac ccataaatcc 2 8 020 
gggaaaacgg aagtgtacca gcgtcaggcc gggtttattg caacccctgg caaggtactg 28080 



gtcttcaccc Tgaccagtcc ccgtcctttt gatgataaag cagacctact ctggaacacc 28140 
tggctggcag gctttcagcc ggataaaaac gaataatcac acggaggtgt gaccatgtat 2 8200 
gaagcagccc gtgtggatga tcctatctac cacaccagcg cgctcgccgg gtttcttatc 28260 
ggcgctatca tcggcatcgc cattatcgcg cttgccgcct ttgccttctt tagctgcggt 28320 
tttcttgccg ggctgattct gggttttatg gccgatcaaa tagcctccgg ggtattgcaa 2 83 80 
ctgggcgagg ccatcgggcg ctccatccac cacacggcag gaaaaatcct caccggttcg 2 8440 
gagaatgtca gcaccaacag tcgcccggcg gcgcgcgcgg tactgagtac ggtgaaatgc 2 8500 
gataaccata tcgcagaaaa acgcatcgcc caagggtcgg aaaatatcta catcaacagc 28560 
cagcccgccg cccgtaagga tgaccacacc gaatgcgacg cggtgattga agacggttcg 28620 
ccgaatgtgt ttctcggcgg cggcacacag acggtactgg aaatcagttc tgaaattccg 2 8680 
gactggctgc gcaaggtggt ggatgtattg tttgtcgtgg cgagtctgct cggcgggctg 2 8740 
gccggggcgt ggcggcaggc ggcaaagctg gggacgaaat ttggcactaa atgtgccgct 2 8800 
aagtttatcg gcggggagct tgtcgggatg gccgtgggtg aggctatcag cgggctgttc 2 8860 
agcaatccgg tggatgtgac caccgggcag aaaatcctgc tgccggaaac ggacttcacc 2 8920 
ctgcccggtc gcctgccggt cacctgctcg cgtttttacg ccagccacct ggaaactgtg 2 8980 
ggactgttgg gacggggctg gcggctgaac tgggaaacca gcctgcgcga tgacgatgaa 2 9040 
cacatcacgc tgaccggcgt acaggggcgg gaactgcgtt acccgaaaac gatgctgacg 2 9100 
cccggccacc agatatttga cccggaagaa cagttatacc tcagccgcct gcatgacggg 2 9160 
cgttacgtgc tgcattacac cgatcgcagc tattacgtat ttggtgattt tgacagtgac 2 9220 
ggcatggcat acctgctgtt tatggagacg ccgcaccgcc agcgcattgt cttcgggcac 2 9280 
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gaaggaggca gactggtacg gatagcctcc 
acacagaccc cggcagggga gcggctgtcg 
ggcaatctgg tggagtaccg gtatgacgat 
gcgggaacgc aggtgcgtca gtttgcttat 
gcgacggggt tcacctgccg ctaccgctgg 
gagcacgaca ccagtgacgg cgaacattac 
accaccgtca ccggcaggca gggggagaca 
atcaccgcgc accggacgcc gggcggtgga 
ttccctgtca acattgagct gcccggcggt 
aaccgggtgg tgaagacgac agate cggaa 
gagttcgacg aaatcacgcg cacggcgctg 
aacgcccacg gccagccagt gcaggagacg 
tacgatgaac aggggcagat gtgeagcegg 
gtgcggcggg agaegcagea gcgggatgcg 
cacggccagc ggacgttcag ctacaaccgg 
cccacggagg eggggcaaca gcagcaccgg 
gaccgcagcg gctggctgac ggeggagcac 
gaegegctgg gcaacccgac ggacatcacg 
tattaeggga gegggcatet gttacagacg 
gagegegaca gcctgcaccg tcagataatg 
ggctatgacg aegaeggget gctgagctgg 
gttcttcctg gccagcgccc ggcgcggcag 
aacaaccacg gcgaggtggg cacgattgac 
tatgacagaa gcggttacct gaccgggcgc 
tattacgata aggegggcaa ectgetggat 
cggctgccgg gctgtggtcg tgaccgttac 
cggcgcgacc agcaactgga gtggaacgcg 
aacaeggaga cgcactacgg etacgatgeg 
gggeggcaca cgggccatac ggegeggage 
ctgttgcagg agaaegtgea gcagcagggc 



ageageggge atcacctgtt actgcaccgc 2 9340 
cgaattgaac tggtgcaggg cggcacccgt 2 9400 
aacggtcaac tgaccggcgt ggtgaaccgg 2 9460 
gaaaaeggge tgatgaegge gcacagcaat 2 9520 
caggaactcg acggcgcgcc gcgcgtgacg 2 9580 
cgctttgact atgattttgc cgcaggcacc 2 9640 
tggcagtggt ggtacgacag ggaaaegtat 2 9700 
atgtaccgct teaegtacaa cgaagaccac 2 9760 
cgcacggtgg cgtatgaata tgacatccag 2 9820 
ggccgggtga cgcagacgca gtggaacggc 2 9880 
gacgatgacg ctgtctggaa aacgeagtae 2 9940 
gaeceggaag ggcgggtgac geagtacget 3 0000 
aeggatgegg cgggcggcac geagggegge 3 0060 
ctgggccgtc tgttacggac ggagaatgaa 3 012 0 
ctggaccaga taacggcagt gacgctcacg 3 0180 
atgcaggccg acacggtgcg ttttgagtat 3 0240 
gcggggaacg gtagcatatg ttatcagege 3 03 00 
ctgccggacg ggcagcacct gaegcatctg 3 0360 
gcgctggacg gectgaeggt gagegagtat 3 042 0 
cgcacgcagg ggcagcttgc gaegtacage 3 0480 
cagcgcagtc tggegtcegg cagtgcccct 3 0540 
ggctgcgtga cgtcgaggga ctattactgg 3 0600 
gacggcctgc gtggcagcgt ggtgtacagc 3 0660 
tcaggtcaga tgtatgacca tgaccgttat 3 0720 
aacgaagggc agggageggt gatgagcaac 3 0780 
ggctataacg agtggggcga gctgaccacg 3 084 0 
caggggcagc tgacgcgggt catcagcggc 3 0900 
ctggggaggc gaacccgcaa ggegaegtae 3 0960 
eggaeggact ttgtgtggga ggggttcagg 3102 0 
tggeggaect atctgtacga tgcggaacag 31080 



ccgtacacgc cggtggcgag cgtgacgggg cggggagaaa gcaggcaggt gtggtattac 31140 
cacacggatg tgacgggcac gccgcaggag gtgacggcgg cggacggaac gctggtgtgg 31200 
gcggggtata tcagggggtt tggagagaat gcggcggaca tcagcaacag cggggcgtac 31260 
tttcaccagc cgctgcggct gccggggcag tattttgacg acgagacagg gctgcattac 31320 
aatctgttca gatattatgc accggagtgt ggacggtttg tcagtcagga tccgatcggg 31380 
ctgaggggcg ggttaaacct ttatcagtat gcgccaaatc ctctcaaata tatagaccca 31440 
cttggtttaa ccgcgactgt tgggcgatgg atggggcctg cggaatatca gcaaatgctt 31500 
gatactggga cagtagtaca aagttcaaca gggacaactc atgttgccta ccctgctgat 31560 
atagatgctt ttggtaagca agcaaaaaat ggtgctatgt atgttgaatt tgatgtgcct 31620 
gaaaaatcat tagtacctac aaatgaagga tgggcaaaaa tagtagggcc agattctatc 31680 



gaagggcgat tagctaaacg caaaggtttg cctgttcctg aaatgccaac agcagaaaac 3 1740 
ataactgtaa ggggcgagaa aattaatggg gaagttgaag caaaatgcta aataaattta 31800 
aattgtgggt gagcaaacat actgattata cggtaattca taatgaaaat gatttatctt 31860 
acagtattat tatagatttt gaagatgacc ggtatatatc aagatttact gtatgggatg 31920 
acctaagctg tatgtcagaa gtaatggatg tggatactgg tttatataaa ttaaacaaga 31980 
gaaacgaatt ttctacattt gatgaacttc tggatatatt tgatgatttt atgataagta 32040 
ttaaataata gttggccggg taagaagtta actcttcccg gctgttttat tatctaaccc 3 2100 
ccatcaatcc ggagacgcgc taccggtacg atgcgctggg caggcgggtg agcaaggcga 3 2160 
cgtacgggcg gcacacgggc catacggcgc ggagccggac ggactttgtg tgggaggggt 3 2220 
tcaggctgtt gcaggagaac gtgcagcagc agggctggcg gacatatctg tacgatgcgg 3 2280 
aacagccgta cacgccggtg gcgagcgtga cgggaaaggg agaaagcagg caggtgtggt 3 2340 
attaccacac ggacgtgacg ggcacgccgc aggaggtgac ggcggcggac ggaacgctgg 32400 
tgtgggcggg gtatatcagg gggtttggag aaaatgcggc ggacatcagc aacagcgggg 32460 
cgtactttca ccagccgctg cggctgccgg ggcagtattt tgacgacgag acagggctgc 3 2520 
attacaatct gttcagatat tatgcaccgg agtgtggacg gttcgtcagt caggatccga 32580 
ttgggctggc ggggggggct gaatctttac cagtatgcgc ctaatccgat tagatggatc 3 2640 
gatcctttag gacttgctat cctggagcat caatctaatt ttgatgcggc aaggagaacc 32700 
ggatttgaaa atgcgggtat gacaaaccct gaggatgtca ctttctcgaa agtcgatccc 32760 
aaaactggta ctgttgttga gtttaaaggt ccaaatgggg ctaaagttgc ttatgatgca 32820 
cctcatgcag atatggatgt gacagcaggg catgataaac cacatgttgg ttggcaatcc 3 2880 



1 



gcaggaaaaa gaggttccgg aggagctaat agaggtaata ttacttatga tggcccacaa 3 2940 
catccgcatc gctctgactc taagggagat gataaatgtt aaattcaaat atgtctgaac 3 3 000 
ttagaatcga actggagaat gcgattaaaa atctcggtat tcatgattat cgtgtcgata 3 3 060 
aacccgaaca aatcgtttct gagataaaag agatatatgt taatggtaat cctagaacct 33120 
ggtggttatc attaaaacat agacaatatg tcttttctta taccgataat tctggatata 33180 
aaaacatatc acaaatagta agtaaacaac tcaatgaaag caatgtaatc aacaaacata 3 3240 
tatttttgat tgctgatgaa gataatgagc aaatatatgt atataacgtt cctcttaact 3 3300 
ccctgcctga aattatagaa aattgcagat attttgaata ttatgttgca gatcatgaac 3 3360 
tatcttggct tatatgtgaa aatgatcatg gtgatttgat tgtatgctca accattaagt 3 3420 
aaagcgcgag tgctctttag cgatatagtt gcccatattt aggcgttact agccgaagat 3 3480 



ggcgcgattg tctggcaggg gaaacagcaa ttctgaggtc aggaagatag cataacccat 3 3540 
taaccgggat agatccgcta gacctgaatc cagttgatgc gacaggttat agggtttatg 33600 
gttatttgct cctggagcaa ataaacctta ttacattggt attactaatg atatggtttg 33660 
acgaagggcc gagcattaaa gcactggcag gttatcaaaa gaaaatggaa ggatgctgcc 33720 
atttgatgaa aatgtaatct aatggaaagt cagaggttac gagaaatatt atatagagaa 33780 
atataaaacc agaaccggaa ccataggtga aaaaattccc tcaacaaata gagagaataa 3 3 840 
atataattca tttgatcatg ggcgaacaga tcccagcgca caagcattta aagactctta 3 3900 
aaatagtaag ggagttggtt ccggtggagg aaaatgcgga tgagtgatta agaattttgg 33960 
ggctgtgata agaagtcgag aacaatgctg cgttttgtga agcccggaga catattttgt 34020 
tttaaattag atgaagatag atattgtttt gggcgaatta taacactaat gactgtcgga 3 4080 
catctttctg aattatttga tataattaaa aaaccccctg gaataacaga gttagaaatt 34140 
agtaatgcaa ggcgaattat tgaaccaatt atagtggata catattcttt atttgataag 34200 
aaattagaaa atggaagtga ctggagaatt attggtcatc aggttaatta caatccaaaa 34260 
aatttagatg gtatctattt tgcacttgga ataggtgatt cctgtaaaaa gaaagactgt 34320 
tacggaaatg attttctcat ttcagaaagt gagtggaaaa cacttcctaa attatctcct 34380 
aaagggggtt ttgatatcaa aaaacggctt gaaattgcct gaaaatgaaa ataaaaagcc 3 4440 
gggaaagatc ttttgtcttc ccggatttta ttatttaatc cccgttcacc acattattta 34500 
cccccgcctt aatatgcttc atcgactttt tcacctgata aagctccttc cgtagatccc 34560 
tcacttcgtc cgtctctgca atcaggatca aacacccctc ggagatcttc acggtgacgc 3 4620 
cgtgcccggt ctcaaatccc gcttcttcct gccagtcacc cttaaggtgc tggctgggga 34680 



tttgtgagta acaggcggtc atgcaggttt cgctgttgat atggcggacg ctgacgccgg 3 4740 
aggcattcat atttgctgac taaataaatt cttatttatc cgccggatgc tggctgattg 3 4800 
tggagctcag ggtgagtgag tatgggcgcg acatcctggc accgcgcctc cctctccccg 3 4860 
gccagccccg gccggtgatg agcagccggc tgccggggcc gtattttgac gatgaaacgg 3 4920 
gcctgcatta aaatctgttc agatattatg taccggagtg t^gctggttc gtcagtcagg 3 4980 
atccaatagg gctgaaaggg ggatggaacc gatatcattc tccgctgaat cctattacag 3 5040 
atagtgatcc tcttggcctt attacttgtg gtgctgatag aggtgattct ggcaagttat 3 5100 
taagatgaaa aatggttgaa aataattgcg ttagtgaatg ctcatttatg cctttaccta 3 5160 
caaaaagtaa cggatttgct tgctggaatt gtgttaacga atgcaaataa aatcaatgct 3 5220 
gttgatgttt ttactacagg aataagtatg gttaacgata aggatacagc tatattaatt 3 5280 
iTgtaat^aa^tg^tgagatt cggtaaggag cttgatgaat "ctgttgctgt tgttcagtcc 3 5340 ~ 
cgttgtgatg aggatgaatt taatgtatat cgagaaacgg ttggttttat catgggtgaa 3 5400 
atgcttatta aaataatgaa tccattatat gaaaaacatc cagaaataaa accaaaagga 3 5460 
ttgaaacaaa acatctggaa ccggatgaat aatgtgtaaa agccggaggg gttatctttt 3 5520 
cccggctttt tattatcaat tactcattaa ctcctgttcc gttcttttgc gtttaatcac 3 5580 
cggaatatct ccggtattgt tcagcgcccc ggaaatgttt ttaaccactg ttctgcactc 3 5640 
cgtttattaa tgcgggttac gcccatccct tcaatacagc caaagagtcc gtgggtatgc 3 5700 
tgcggcgtga tcacgatgca atccctcatt acccgcacct tgacgggcat cccgttaata 3 5760 
aaccccgcct gcggcatcca ttcaccgtac aaacgaactg aaagtctctc aacgcgtgag 3 582 0 
tatgtaagta tcccgcataa tcgagccatt cacatttaga gatcatccga cataatcaat 3 5880 
ctgccaacgc aggagatcgc tatgcgtaaa gcccgtatta ctgcgcacca gatcatcgct 3 5940 
gtgattagat cagtcgaatc cggacggact gttaaagatg tctaccggga gcccggtatt 3 6000 
tctgaagcca ccagggacaa ctggaggtct ggatacggcg gcagggatac gcgtggaatc 36060 
acaaaaggct ccaccgtatt tactgtctgc tcaagctgaa ttttcgccgt aagggtaaac 3 6120 
aacggctgcc ggtacgcaat ccctcgccac tggtcacgcc ggaagcgctg aaccagagct 3 6180 
ggtctgtggg cgtcgttttc gcacgttcaa tgttgttgat gactgtaatc gtgaagcgtt 3 6240 
gtcgattgaa atcgatctga atctgccagc tctgcgagtg gtccgtgtac tcgacaggat 3 6300 
tacagcaacc gcggttatct ggccatgctg cgtatggata agggaccgga atttatctcg 36360 
ctggcactgg ctgaatgggc aaagaaacat gcagtaaagc tggcgtttat ccagccgggt 3 6420 
aagccgaaga aaaacgtttt catcacgcgc tttaaccgga cataccgtac agaaatactc 3 6480 



aattcttatc tgttcagaac gctgaatgag gtgtgggaaa ttacggataa agggttatca 3 6540 
gaatataact gcgaacgtcc acatgaatcg cggaacaata tgataccgaa ggaataccgc 3 6600 
caataacgtt atctggccgg aatcttaaaa atgcatggaa ctaaaacggg tctatttaca 3 6660 
ggggcacctg cgatgaattt cgctgcactg aaaagcgata ccggatgaga gctgcttcaa 3 6720 
attaatgtgc catgttcacg gggaggttgt gcgacgtttg cataatccag caagaactga 3 6780 
aaggaagggg agagcttttt catgcctgta taatcagtct ggcctgtgtc agtcagctct 3 6840 
tagtgttgag actctcgttg gagcgttata attgcttttc tgtttcggaa aacaagattt 3 6900 
tccattaaag atcttccctg cgaggaaaag ttaactaata atcttaccgt cgagttagga 3 6960 
gatgtatgtt taaatataaa caatgttgca acgatgcctg ataattatcc tctcttcgaa 3 7020 
gataagtttc ccacacccag tgtagtaggt gtcatggtaa tgttatcact tgaatgtaaa 3 7080 



tggaaggtat aattgctttt "tgactggcat tctattccac cctgacaaca cgatgttaac 3 7140 

atcaacactg tttatattgg caataacgca atttttttca gattaagagg tgctctgata 3 7200 

tatagatttt tatgacatta cttatttgaa ttggtaacaa ataccaataa gtacaagctg 3 7260 

ttattaccag ccacggattt tttacatacg gtaagatttg gtatggcgtt atgtattctg 3 7320 

gatgtgctgg attattttaa tttggtttaa aaaaggtggt tattcaa atg aaa age 3 7376 

Met Lys Ser 
1 

ata aaa aaa ttg att ate gca agt gcg ttg age atg atg get get agt 3 7424 
He Lys Lys Leu lie He Ala Ser Ala Leu Ser Met Met Ala Ala Ser 
5 10 15 

tgt tat get ggc tea ttt ttg ccg aac tea gag caa caa aaa tea gtg 3 7472 
Cys Tyr Ala Gly Ser Phe Leu Pro Asn Ser Glu Gin Gin Lys Ser Val 
20 25 30 35 



gat att gtg ttt tec tct ccc caa gat tta acc gta teg ctt att cca 
Asp lie Val Phe Ser Ser Pro Gin Asp Leu Thr Val Ser Leu lie Pro 

40 45 50 



37520 



gtg teg ggc tta aag get ggg aaa aat get cct age gcg aaa att gcg 3 7568 
Val Ser Gly Leu Lys Ala Gly Lys* Asn Ala Pro Ser Ala Lys He Ala 
55 60 65 



aag ctt gta gtt aat tct act act ctt aaa gaa ttc ggg gtc agg ggg 
Lys Leu Val Val Asn Ser Thr Thr Leu Lys Glu Phe Gly Val Arg Gly 
70 75 80 

att tct aac aac gtg gta gac agt act ggc act gca tgg cgt gta get 
He Ser Asn Asn Val Val Asp Ser Thr Gly Thr Ala Trp Arg Val Ala 
85 90 95 



37616 



37664 



ggt aaa aat act ggt aaa gag ate ggt gtg ggc tta tea agt gac agt 3 7712 
Gly Lys Asn Thr Gly Lys Glu He Gly Val Gly Leu Ser Ser Asp Ser 
100 105 HO H5 
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ctt aga aga tct gat age acg gaa aaa tgg aat ggg gtg aac tgg atg 
Leu Arg Arg Ser Asp Ser Thr Glu Lys- Trp Asn Gly Val Asn Trp Met 

120 125 130 

acc ttt aat age aat gac aca ctt gat att gtc ctg aca gga ccg gcg 
Thr Phe Asn Ser Asn Asp Thr Leu Asp lie Val Leu Thr Gly Pro Ala 
135 140 145 

cag aat gtc aca get gac acg tac cca ata act tta gac gta gtg gga 
Gin Asn Val Thr Ala Asp Thr Tyr Pro lie Thr Leu Asp Val Val Gly 
150 155 160 

tat caa cct taa tagtaaacaa ctattagtgt attgtgcctt gtttaaggcg 
Tyr Gin Pro 
165 

caatacacat caaatcatct atttttcttt tacaattttt gat atg aaa ata gtt 

Met Lys lie Val 
170 



37760 



37808 



37856 



37908 



37963 



aat ttt get gta atg gcg gta get ttg ttc gec act aat tct atg gtt 3 8011 
Asn Phe Ala Val Met Ala Val Ala Leu Phe Ala Thr Asn Ser Met Val 
175 180 185 

tea gta tat gee gtc aac cag caa tta aat tea gee act aaa tta ttc 3 8059 
Ser Val Tyr Ala Val Asn Gin Gin Leu Asn Ser Ala Thr Lys Leu Phe 
190 195 200 

age gtg aag ctg ggg get aca cga gtg att tat cac get ggt acg get 3 8107 
Ser Val Lys Leu Gly Ala Thr Arg Val He Tyr His Ala Gly Thr Ala 
205 210 215 



gga gcg acg etc teg gtg age aac ccg cag aat tac cct att ttg gtt 
Gly Ala Thr Leu Ser Val Ser Asn Pro Gin Asn Tyr Pro He Leu Val 
220 225 230 235 

cag tct tea gtc aaa gca gca gac aaa agt teg cct get ccc ttt ttg 
Gin Ser Ser Val Lys Ala Ala Asp Lys Ser Ser Pro Ala Pro Phe Leu 

240 245 250 



38155 



38203 



gtg atg ccg cct eta ttt cgt tta gaa gca aac cag cag agt caa ctg 38251 
Val Met Pro Pro Leu Phe Arg Leu Glu Ala Asn Gin Gin Ser Gin Leu 
255 260 265 



cgt att gtc cgt act ggt ggt gac atg cca acg gat cgt gag act tta 
Arg He Val Arg Thr Gly Gly Asp Met Pro Thr Asp Arg Glu Thr Leu 
270 - 275 280 



38299 



cag tgg gtc tgt ata aag gcg gta cca ccc gaa aat gaa ccg teg gat 3 8347 
Gin Trp Val Cys He Lys Ala Val Pro Pro Glu Asn Glu Pro Ser Asp 
285 290 295 



aca cag get aag ggc gcg acc ctt gac etc aat ttg tec ate aac gee 
Thr Gin Ala Lys Gly Ala Thr Leu Asp Leu Asn Leu Ser He Asn Ala 
300 305 310 315 



38395 



tgt gat aag ctg att ttc cgc ccg gat gee gtg aag ggg acg ccg gaa 
Cys Asp Lys Leu He Phe Arg Pro Asp Ala Val Lys Gly Thr Pro Glu 

320 325 330 



3 8443 



gat gtt gca gga aat tta aga tgg gtg gag acg ggc aac aaa ctt aag 3 8491 

Asp Val Ala Gly Asn Leu Arg Trp Val Glu Thr Gly Asn Lys Leu Lys 

335 340 345 

gtg gag aac ccc acc ccg ttt tac atg aat tta gcc tct gtc aca gta 3 8539 

Val Glu Asn Pro Thr Pro Phe Tyr Met Asn Leu Ala Ser Val Thr Val 

350 355 360 



ggg gga aag ccc att aca ggg ctt gag tat gtc ccc ccc ttt get gac 

Gly Gly Lys Pro lie Thr Gly Leu Glu Tyr Val Pro Pro Phe Ala Asp 
365 370 375 

aaa aca eta aat atg cca ggt agt gcc cat ggt gat ate gag tgg aga 

Lys Thr Leu Asn Met Pro Gly Ser Ala His Gly Asp lie Glu Trp Arg 

380 385 390 395 

gtt att aca gac ttt ggt ggt gaa agt cat ccg ttc cac tac gtt ctt 

Val He Thr Asp Phe Gly Gly Glu Ser His Pro Phe His Tyr Val Leu 

400 405 410 



38587 



38635 



38683 



aaa taa atccaggggc ttagcggcag aaa atg aag ttc aaa caa cct gcc ttg 3 8736 
Lys Met Lys Phe Lys Gin Pro Ala Leu 

415 420 



38832 



38880 



eta ctg ttc ate gcg gga gtg gtt cat tgc gca aat gcg cac act tac 3 8784 
Leu Leu Phe He Ala Gly Val Val His Cys Ala Asn Ala His Thr Tyr 
425 430 435 

aca ttc gat gca tea atg ttg ggc gat gca gcg aaa ggg gtt gat atg 
Thr Phe Asp Ala Ser Met Leu Gly Asp Ala Ala Lys Gly Val Asp Met 
440 445 450 

teg etc ttt aac cag ggg tta caa cag cca ggg act tat cgc gtg gac 
Ser Leu Phe Asn Gin Gly Leu Gin Gin Pro Gly Thr Tyr Arg Val Asp 
455 460 465 

gtg atg gtg aat ggt aaa cgt gtc gac acc cgt gat gtg gtg ttc aaa 38928 
Val Met Val Asn Gly Lys Arg Val Asp Thr Arg Asp Val Val Phe Lys 
470 475 480 485 

ttg gaa aag gat ggg caa gga acg cct gtt ctg get cct tgt ttg acg 3 8976 
Leu Glu Lys Asp Gly Gin Gly Thr Pro Val Leu Ala Pro Cys Leu Thr 

490 495 500 

gtc agt cag ctt tea cgc tac ggc gta aaa acg gaa gat tac cct cag 3 9024 
Val Ser Gin Leu Ser- Arg Tyr Gly Val Lys Thr Glu Asp Tyr Pro Gin 
505 510 515 

ttg tgg aaa gca gca aag ccc cca gat gag tgt gcg gat ctg acc gcc 3 9072 
Leu Trp Lys Ala Ala Lys Pro Pro Asp Glu Cys Ala Asp Leu Thr Ala 
520 525 530 

att cca cag get aaa gcg gta ctg gat ate aat aat cag caa ctg caa 39120 
lie Pro Gin Ala Lys Ala Val Leu Asp He Asn Asn Gin Gin Leu Gin 
535 540 545 

ctg agt att ccg cag ttg gcg ttg cgt ccg gaa ttt aag ggg ate get 3 9168 
Leu Ser He Pro Gin Leu Ala Leu Arg Pro Glu Phe Lys Gly He Ala 
550 555 560 565 



cca gaa gat ctt tgg gat gat ggt att ccg gcg ttt ctg atg aac tac 
Pro Glu Asp Leu Trp Asp Asp Gly lie Pro Ala Phe Leu Net Asn Tyr 
570 575 580 



39216 



agt gcg agg aca acg cag acg gat tac aaa atg gat atg gtg ggg cgt 3 9264 

Ser Ala Arg Thr Thr Gin Thr Asp Tyr Lys Met Asp Met Val Gly Arg 
585 590 595 

gac aac tct tec tgg gta caa ctg caa ccg gga ate aat ata ggt gcg 3 9312 

Asp Asn Ser Ser Trp Val Gin Leu Gin Pro Gly lie Asn lie Gly Ala 

600 605 610 

tgg c 9t 9 tc C 9 C aat 9 C 9 acc a 9 c tgg ca 9 c 99 a 9t a 9t caa ct 9 teg 3 9360 

Trp Arg Val Arg Asn Ala Thr Ser Trp Gin Arg Ser Ser Gin Leu Ser 
615 . 620 625 

999 aa 9 tgg cag gca gca tat acc tat get gag cgt gga ctg tac tea 3 94 08 

Gly Lys Trp Gin Ala Ala Tyr Thr Tyr Ala Glu Arg Gly Leu Tyr Ser 
630 635 640 64 5 



eta aaa agt cgt ctg act ctg ggg caa aag act teg cag ggg gag ata 3 9456 

Leu Lys Ser Arg Leu Thr Leu Gly Gin Lys Thr Ser Gin Gly Glu lie 
650 655 660 

ttt gat agt gtg cca ttt acc ggt gtg atg ttg gca teg gat gac aac 3 9504 

Phe Asp Ser Val Pro Phe Thr Gly Val Met Leu Ala Ser Asp Asp Asn 
665 670 675 

atg gtg ccc tac agt gag egg cag ttt get ccg gta gtg cgt ggg att 3 9552 

Met Val Pro Tyr Ser Glu Arg Gin Phe Ala Pro Val Val Arg Gly He 

680 685 690 

gec cgc acg cag get egg gtg gag gtc aaa cag aat ggt tac acc att 3 9600 

Ala Arg Thr Gin Ala Arg Val Glu Val Lys Gin Asn Gly Tyr Thr He 
695 700 705 

tac aac acc act gtg gcg ccc gga ccg ttt gca ctg egg gat ctg teg 3 9648 

Tyr Asn Thr Thr Val Ala Pro Gly Pro Phe Ala Leu Arg Asp Leu Ser 
710 715 720 725 

gta aca gac agt agt ggt gat ctg cat gtc acc gtg tgg gag gee gat 3 9696 

Val Thr Asp Ser Ser Gly Asp Leu His Val Thr Val Trp Glu Ala Asp 
730 735 740 

ggc agt aca caa atg ttt gtg gtg ccg tat cag acc ccg gcg ata gca 3 9744 

Gly Ser Thr Gin Met Phe Val Val Pro Tyr Gin Thr Pro Ala lie Ala 
745 750 755 

ctg cac cag gga tat ttg aag tac age ctg ttg gcg ggc cga tac cga 3 9792 

Leu His Gin Gly Tyr Leu Lys Tyr Ser Leu Leu Ala Gly Arg Tyr Arg 

760 765 770 

teg tea gac tct gca acg gat aag egg cag ate gcg cag get acg ttg 3 9840 
Ser Ser Asp Ser Ala Thr Asp Lys Arg Gin He Ala Gin Ala Thr Leu 
775 780 785 

atg tat ggt ctg ccg tgg aat etc act gca tac ggc ggt ata cag agt 3 9888 
Met Tyr Gly Leu Pro Trp Asn Leu Thr Ala Tyr Gly Gly He Gin Ser 
790 795 800 805 



gca acg cat aat caa get gca ttg ctt ggt ttg ggg gga tct etc ggg 3 9936 

Ala Thr His Asn Gin Ala Ala Leu Leu Gly Leu Gly Gly Ser Leu Gly 

810 815 820 

egg tgg ggg agt tta tct gtc gat gga age gac aca cac agt cag cgt 3 9984 

Arg Trp Gly Ser Leu Ser Val Asp Gly Ser Asp Thr His Ser Gin Arg 

825 830 835 



cag ggg gag gcg gta cag caa gga gec tec tgg cga ctg cgt tac age 
Gin Gly Glu Ala Val Gin Gin Gly Ala Ser Trp Arg Leu Arg Tyr Ser 
840 845 850 

aac cag ctg act gcg acg ggg aca aat ttt ttt ctg acg aga tgg cag 
Asn Gin Leu Thr Ala Thr Gly Thr Asn Phe Phe Leu Thr Arg Trp Gin 
855 - 860 865 

tat gee teg cag ggc tat aac acc eta tec gat gtg etc gac agt tat 
Tyr Ala Ser Gin Gly Tyr Asn Thr Leu Ser Asp Val Leu Asp Ser Tyr 
870 875 880 885 



40032 



40080 



40128 



40176 



cga cat aat ggc aac cgt eta tgg teg tgg egg gaa aat ttg cag ccg 
Arg His Asn Gly Asn Arg Leu Trp Ser Trp Arg Glu Asn Leu Gin Pro 

890 895 900 

age teg cgt act acc ctg atg ttg agt cag tea tgg ggg agg cat ttg 4 0224 
Ser Ser Arg Thr Thr Leu Met Leu Ser Gin Ser Trp Gly Arg His Leu 
905 910 915 

ggc aat ctg agt tta acc ggt tec cgt acc gac tgg cgt aat cgc ccc 40272 
Gly Asn Leu Ser Leu Thr Gly Ser Arg Thr Asp Trp Arg Asn Arg Pro 
920 925 930 

ggt cat gat gac age tac gga ctg agt tgg gga acc tct ate gga ggg 
Gly His Asp Asp Ser Tyr Gly Leu Ser Trp Gly Thr Ser He Gly Gly 
935 940 945 

ggc teg ctg tea ttg aac tgg aat caa aac aga acg ctg tgg cgc aat 
Gly Ser Leu Ser Leu Asn Trp Asn Gin Asn Arg Thr Leu Trp Arg Asn 
950 955 960 965 

ggc gcg cac cgt aaa gag aac ata acc age ctg tgg ttc agt atg cca 
Gly Ala His Arg Lys Glu Asn He Thr Ser Leu Trp Phe Ser Met Pro 

970 975 980 

tta age cgc tgg acg ggg aat aat gta agt get agt tgg cag atg act 40464 
Leu Ser Arg Trp Thr Gly Asn Asn Val Ser Ala Ser Trp Gin Met Thr 
985 990 995 

tea cca tea cac ggt ggt cag acg caa caa gtg ggg gtc aac gga gag 4 0512 
Ser Pro Ser His Gly Gly Gin Thr Gin Gin Val Gly Val Asn Gly Glu 
1000 1005 1010 



40320 



40368 



40416 



gca ttc agt cag caa ctg gat tgg gag gtg cgt cag agt tac cgt gee 
Ala Phe Ser Gin Gin Leu Asp Trp Glu Val Arg Gin Ser Tyr Arg Ala 
1015 1020 1025 

gat gee ccg cca ggt ggt ggt aat aac age gca ttg cac ttg gca tgg 
Asp Ala Pro Pro Gly Gly Gly Asn Asn Ser Ala Leu His Leu Ala Trp 
1030 1035 1040 1045 



40560 



40608 
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aat ggg gat tac ggc ctg tta ggt ggt gac tat age tac age egg gcg 4 0656 
Asn Gly Asp Tyr Gly Leu Leu Gly Gly Asp Tyr Ser Tyr Ser Arg Ala 
1050 1055 1060 



atg cgc cag atg gga gtc aat ate gcg gga ggt ata gtt ate cac cat 
Met Arg Gin Met Gly Val Asn lie Ala Gly Gly He Val He His His 
1065 1070 1075 



gaa gcg cca ggg gee teg ggg gtg cca gtt ggc ggc tgg cct ggc gtt 

Glu Ala Pro Gly Ala Ser Gly Val Pro Val Gly Gly Trp Pro Gly Val 

1095 . HOO 1105 

aag acg gat ttt cgt ggc gac acc aca gtg ggc aac ctg aac gtc tat 

Lys Thr Asp Phe Arg Gly Asp Thr Thr Val Gly Asn Leu Asn Val Tyr 

1110 1H5 H20 1125 



ctg aaa egg gaa gat ggt age gec att cct ttc ggg gcg cag gtt aca 
Leu Lys Arg Glu Asp Gly Ser Ala He Pro Phe Gly Ala Gin Val Thr 
1175 1180 1185 

gtc aat ggg cag gat ggc agt get get ctg gtg gat act gat age cag 
Val Asn Gly Gin Asp Gly Ser Ala Ala Leu Val Asp Thr Asp Ser Gin 
1190 H95 1200 1205 

gtt tat etc act ggt ttg gcg gat aag ggc gaa ctg acg gtg aaa tgg 
Val Tyr Leu Thr Gly Leu Ala Asp Lys Gly Glu Leu Thr Val Lys Trp 
1210 1215 1220 

gga gca cag caa tgt egg gtt aac tac cgc eta cct gec cac aag gga 
Gly Ala Gin Gin Cys Arg Val Asn Tyr Arg Leu Pro Ala His Lys Gly 
1225 1230 1235 



40704 



cat ggt gtg acg ctg ggg caa cct ttg caa ggc tea gtg gcg ctg gtt 4 0752 
His Gly Val Thr Leu Gly Gin Pro Leu Gin Gly Ser Val Ala Leu Val 
1080 1085 1090 



40800 



40848 



40896 



cag gag aat aca gtc age etc gat ccg teg cga eta ccg gat gac gca 

Gin Glu Asn Thr Val Ser Leu Asp Pro Ser Arg Leu Pro Asp Asp Ala 
1130 1135 H40 

gag gtc aca caa acc gat gtg cgc gtg gtg cca acc gaa ggg gcg gtg 4 0944 

Glu Val Thr Gin Thr Asp Val Arg Val Val Pro Thr Glu Gly Ala Val 
1145 H50 H55 

gtg gaa gcg aag ttt cac act cgc ate ggg gec agg gca ctg atg acg 4 0992 

Val Glu Ala Lys Phe His Thr Arg He Gly Ala Arg Ala Leu Met Thr 
1160 H65 H70 



41040 



41088 



41136 



41184 



ate gcg ggc ttg tat caa atg age ggt etc tgc aga tag ccgattctga 41233 
He Ala Gly Leu Tyr Gin Met Ser Gly Leu Cys Arg 

1240 1245 1250 

aggagagaat a atg tgg atg aaa ata cag cga gtg aaa acg gtt ate tat 41283, 
Met Trp Met Lys He Gin Arg Val Lys Thr Val He Tyr 
1255 1260 

age gta age tta ctg gtc get gec agt age ttg gtg ccg ata gcg aac 41331 
Ser Val Ser Leu Leu Val Ala Ala Ser Ser Leu Val Pro He Ala Asn 
1265 1270 1275 



gcc gca gaa aaa ctt cag aca acg eta cgt gta ggt act tac ttt egg 41379 
Ala Ala Glu Lys Leu Gin Thr Thr Leu Arg Val Gly Thr Tyr Phe Arg 
1280 1285 1290 1295 

get ggg cac gtg cca gat ggg atg gtg ctt gcg caa ggc tgg gtg act 4 1427 
Ala Gly His Val Pro Asp Gly Met Val Leu Ala Gin Gly Trp Val Thr 
1300 1305 1310 

tat cac ggc agt cac age ggg ttt egg gta tgg age gat gag caa aag 41475 
Tyr His Gly Ser His Ser Gly Phe Arg Val Trp Ser Asp Glu Gin Lys 
1315 1320 1325 

gcg ggt aac acg cct acc gta ttg ctg ctg age ggg caa cag gat cct 41523 
Ala Gly Asn Thr Pro Thr Val Leu Leu Leu Ser Gly Gin Gin Asp Pro 
1330 . 1335 1340 

cgc cat cac att cag gtt cgc ctg gag ggc gag ggg tgg caa cca gat 41571 
Arg His His He Gin Val Arg Leu Glu Gly Glu Gly Trp Gin Pro Asp 
1345 1350 1355 



acg gtg agt ggt cgt ggc gcc att tta aga acc get gca gat aac gcc 41619 
Thr Val Ser Gly Arg Gly Ala He Leu Arg Thr Ala Ala Asp Asn Ala 
1360 1365 1370 1375 



agt ttc agt gtg gtc gtt gat ggc aat cag gaa gtg cct gcg gac acc 
Ser Phe Ser Val Val Val Asp Gly Asn Gin Glu Val Pro Ala Asp Thr 
1380 1385 1390 



41667 



tgg acg ctg gat ttt aag gcc tgt gca ttg gcg cag gag gat acg tag 4 1715 
Trp Thr Leu Asp Phe Lys Ala Cys Ala Leu Ala Gin Glu Asp Thr 
1395 1400 1405 

ccgtctgttc cactcatact tcctgtatca gtgaataagg cgaatactgt atatcaggtt 41775 

cgaggatgta ttgaaaattt actcaaggca aatatggtta gcaatcttca ttgcagaaaa 4 1835 

ateaattget gattgatttt aaaatctgaa attcatcttt tttgtaggga gaggaatatt 41895 

atgtttcgta aaataattta aeggacactg atgectttat tatttctgtt ttgtggattt 41955 

gtttgattca ctcttttgtg ageggggett tataagcccc tggtataagg tttgtttatc 42015 

tgtcaggtat tcatatgtga tatttaagat tttatgttta gggggctttt tcttgtcccc 42075 

teaggegtaa aaataattta ttttttacat aaaggaataa agcatatgtc ttatgeaega 42135 

catttacegg tattaatgta tcaccatgtc agtgataaac ceggacagat aaccttatct 42195 

ccccgtacgt tccgggcgca gatgaaatgg ctggccgaat ctggctggaa aacegttact 42255 

gectgeagag gtggaagctt tttatcatgg tgcaagattg ectegtaaaa gegtcatget 42315 

gacctttgat ggcggctggc tggataactg gttgcaggtt tttccggtgc tgcaggagtt 42375 

taatctgeat gcgcatctct ttcttgtgac cagtttgatc agtgacggac eggtcegtat 42435 

tcctgcaggc gaaccggtgt actctcatga tgagtgtcaa atgctggtta aacaaggecg 42495 

ggctgatgag gtcatgctgc gctggtcaga ggtcegggag atgcacctca gtggccttgt 42555 



1 

tgagtttcac tcgcacacgc acacccaccg acgctgggac cagaagcctg tgtcccgtaa 42615 
tccgtcggat ttgcttcgtg tcgatattct tcttagtcgt aagcggatga gggagatgct 4 2675 
gggttattgc agtcagcatc tgtgctggcc tgagggctgg tattgttctg actatattca 4 273 5 
tgtggctgaa gagttggggt tcacatacct gtataccaca gaaaggcgta tgaacaatcc 4 2795 
agtcatcggt tcacagcgta ttggtcgtat caacgcaaag gagcgaaaga atgtgggctg 4 28 55 
gctgaaacgt cgtctgtttt atcacaccac gcccggattt tcttcgctgc tggcccggca 42915 
taagggggca cgtcggatag ctgactgagc cagagaccag gatgaaagtg ctgcataccg 4 2975 
aatcctcccc catcatcggc gggcaggggt tgtaggctat atcccaaatg atggtgctga 4 3 035 
ttttgagcag aaataagata acaccggagg ccgtgataga cccggttgcg cagaatctgg 4 3095 
aggcagcagg tcactggcgc agggcttcta cccgctggct tttggttatg ggagattttg 4 3155 
aatfgtacaga: ggcccagcgg" gagtggttgt ~tgttg"cgccgT gaattattgc "ct tgcgcaga 4 3 215 
tatcttcccc tggcgccagt aaagctggat atcagcgacg tgacgaaatg agaggagcct 43275 
gcacataggt ttgtgt tttg agtaagtgtc ttatttaaac cgtctgttct gtttcctccg 4 3335 
ctttcacaaa taatgtcgag ccgggtgggg gactcaagta agaataatct ggcgatgttt 43395 
tgcttgtttc cacgggatac tttgttaggt gaacgataca attaatgcgc tccagactgc 43455 
gtcacagaaa aggagaatgc ctttgcagaa cgggcgcatg acaatgaacc actttattat 4 3 515 
tggtatctgg ttcggcgaca accttgacga tcacgtctga gatagggtat ttacactgag 4 3575 
tggtaaacag gttcaattag taaccggaga tggatgcaaa atcatgatcg attcagatgc 4 363 5 
tattttgcag ccaatagatt ttttattaag atgatatcaa ggacattgag gcacacaatg 4 3695 
acgtatcagt aagtcgttga tagctcattt gatataagaa tttcttttat caacggaaga 4 3 755 
taatgatgga actgatcaat aatcgtggta tgcgagactg gatgatattt attaaagtgg 4 3815 
cggaagtagg gaatctttcc cgggctgcgc gggaattaga tattagcatt tctgctgtca 4 3 875 
gtaaatcgct tagtcgcctt gagaattcta ttgaggttac tttacttcgg cgcgattcac 4 393 5 
atcacttaga actgactgga gctggtcaga cagcctatgc aagcatgaaa aggataacat 4 3 995 
cttcctttca gtccttgctg gatgaattgc gaaatccgga taaaattatc agagggagta 44055 
taaaattttc ggctccggct attgtctgtg agtttcttgc caataagtgg atatgggaat 4 4115 
ttacagctag ctatccggat acaaaaatct acctggattc acgagagcgt agcgattttt 44175 
ttagtaaatc cctggagttt gatgagctgg tttttaaaag tggcataatc gaaagtgagg 44235 
atctcgtgta tcgaaagata agccctttaa agttggttct ttgtgcgagt ccgaaatata 44295 
tcagaaaata tggcaggatc tcacaccctg gcgatttgga aaatcacatt attgtgggtc 443 55 



ttcacaacca tggtctttcc ggacctctta ctcttttccg tcaggatgaa tcatacacta 44415 
ttagtggcgc tgttaatgtt catttatctt ccaataatct tttgagtgtt cttaatttgg 44475 
ttttagaagg aaagggtatc aacctcatga ctccggcctg gcttgccacc aaatacttaa 44535 
aaaataatga acttgaaatt atacttcctg aatggagggt tccagatctc cccatttatc 44595 
ttgtatggcg tcatcgtcag tattattctc ctttatttca acgctttctg tcttttattg 44655 
aagataaatg gaataatcgc ccacaaattg attttctgaa tgatgattaa cccgtttgga 44715 
atggttttga tacgttcctg acttaaaccc acatgatgac tgaattgagg catcgagata 44775 
tgcgactggt cagccagtcg tcttttgacg atgcccaata caaaacatga tgccgactga 44835 
cggaaatgat aatacgcgga aacaggacgg ggctgttttt gggcagccgg aagttaagcc 44895 
cataccagaa acgttgcagt gtactgaaaa atggcgccag gttgcacctg ttcaaagatt 44955 



"ttctgaaggc gcaggagtat tcattactga tatcctccat tgcgccttcg ggaacccaca 4 5015 
ggaccagcta ttttaccgat agtgtttaaa aggcgtaagt aatgccgagc atgaagtcat 45075 
tggaggcagc ctttgtgtct gcatcataag cggtatgttc atcaccagca tagtgatttt 45135 
ttgaaatgct tactttgcca gcattaatgt atttataact ggcgtcaatc ataatattat 45195 
ctgttacagc atattttgca ccgatacctg cgccccaggc aaagttattt tttgaagcag 45255 
acagagtttc attaatacca aaaccaacag gaatggtgtt attacttagc ttcacatgag 45315 
cgaggccaac gcctgcgctg atatagggag taaatgccgt actattgtga aaatcataat 4 5375 
agccattaac catgtaagtg gtcattcgga cctgattttt tacatttatg tgtactggat 45435 
caccaaatgc aataatatcc tgcccgcctt tagcatccgt ctcacctctg aaagtggtat 4 5495 
ccagttctaa acgtactgga agctggaatg gatcataaaa gtcataaccg atagcaaccc 45555 
cgccgccaaa aacgcctttg gtacggtcag gtaacgttgc atgaccatta actatctcat 45615 
cctggctgaa ggttgagttg attccataga cattgactac ggatgtcccc gctttcccgg 4 5675 
tgatatagat cccttctttt gctgatgcag tagcggacca ggctaccaca aggggaatga 4 5735 
tgcagactgc gaaaaagttt ttcatttcag aacctgcctt aatattgggc taaaagacaa 45795 
gtttcacggt atagggtgtg atataacgat tacataaacg aagcccaaaa aacggtctat 4 5855 
tgtaacgctg ggttttctgt aagcgggtaa aaaatgagat gaagatttta aataacaata 4 5915 
cgataatcgt cggtatggaa atccatctcc tcgccaaatt gccccacgta cggtttcact 4 5975 
tctacgttat gtaacgggta gtgtgagatg gagcgatgct gtaagaaaaa gatgaagatg 46035 
aatttgtacc cgacctggat aaagcccgtt atcccggaat aacgggcaaa aatatttact 4 6095 
caagtgcctg ggcgagatct tgttgtacct gttgacgctg ttctggtgtt aagactttgc 4 6155 



ttaaatcaaa ataatattta acccgataat agcgagcctg ttgttctatg ttactgaagg 46215 
ctgcaagctg ctgttttacg gcggcgtcat cccatttacc ggatttaatc acctctatca 46275 
gcgcaccgtc tttaattccc ttcatagaaa tctgactgac gtcggtttcc agttgttggt 46335 
gaagtttttt gatccgggta atctgatcgt ttgtcagctt cagatgctgg acaataggat 46395 
cctgggcggg caggggagga ttggggacag cggtggcgaa agcgccaaaa gaaacgcccg 46455 
ccagagtcgc tgccagtaaa gttgtgcgta caaagttttt catgaagata tcctgataag 4 6515 
ggagtgatta accgttttta ttacccacga atggcgagca attatcttag agcctatccc 46575 
agtagggcta ttttacttgc cattttggac ctgggcagtg ctcgccaaaa cgcgttagcg 46635 
ttttgaacgc cgctagcggc ggcccgaagg gcgagcgtag cgagtcaaac ctcacgtact 4 6695 
acgtgtacgc tccggttttt gcgcgctgtc cgtgtccaaa ctggctgcgc caataacgcc 4 6755 



tggtgggata ggctcttagt cagaatacgt tgcctgccac attacgccac gcgaatttgt 46815 
tttacggaga gttacggagt gaaacaatcc cgccgcggtg agcggcaggt tgctt 4 6870 



<210> 2 
<211> 166 
<212> PRT 

<213> Salmonella typhi murium 
<400> 2 

Met Lys Ser He Lys Lys Leu He He Ala Ser Ala Leu Ser Met Met 
! 5 10 15 

Ala Ala Ser Cys Tyr Ala Gly Ser Phe Leu Pro Asn Ser Glu Gin Gin 
20 25 30 

Lys Ser Val Asp He Val Phe Ser Ser Pro Gin Asp Leu Thr Val Ser 
35 40 45 

Leu He Pro Val Ser Gly Leu Lys Ala Gly Lys Asn Ala Pro Ser Ala 
50 55 60 

Lys He Ala Lys Leu Val Val Asn Ser Thr Thr Leu Lys Glu Phe Gly 
65 70 75 80 

Val Arg Gly He Ser Asn Asn Val Val Asp Ser Thr Gly Thr Ala Trp 

85 90 95 

Arg Val Ala Gly Lys Asn Thr Gly Lys Glu He Gly Val Gly Leu Ser 
100 105 HO 

Ser Asp Ser Leu Arg Arg Ser Asp Ser Thr Glu Lys Trp Asn Gly Val 
115 120 125 

Asn Trp Met Thr Phe Asn Ser Asn Asp Thr Leu Asp He Val Leu Thr 
130 135 140 

Gly Pro Ala Gin Asn Val Thr Ala Asp Thr Tyr Pro He Thr Leu Asp 
145 150 155 160 



Val Val Gly Tyr Gin Pro 

165 



<210> 3 
<211> 245 
<212> PRT 

<213> Salmonella typhi murium 



<400> 3 

Met Lys lie Val Asn Phe Ala Val Met Ala Val Ala Leu Phe Ala Thr 
15 10 15 

Asn Ser Met Val Ser Val Tyr Ala Val Asn Gin Gin Leu Asn Ser Ala 
20 25 30 

Thr Lys Leu Phe Ser Val Lys Leu Gly Ala Thr Arg Val lie Tyr His 
35 40 45 



Ala Gly Thr Ala Gly Ala Thr Leu Ser Val Ser Asn Pro Gin Asn Tyr 
50 55 60 

Pro He Leu Val Gin Ser Ser Val Lys Ala Ala Asp Lys Ser Ser Pro 
65 70 75 80 

Ala Pro Phe Leu Val Met Pro Pro Leu Phe Arg Leu Glu Ala Asn Gin 

85 90 95 

Gin Ser Gin Leu Arg He Val Arg Thr Gly Gly Asp Met Pro Thr Asp 
100 105 HO 

Arg Glu Thr Leu Gin Trp Val Cys He Lys Ala Val Pro Pro Glu Asn 
115 120 125 

Glu Pro Ser Asp Thr Gin Ala Lys Gly Ala Thr Leu Asp Leu Asn Leu 
130 135 140 

Ser He Asn Ala Cys Asp Lys Leu He Phe Arg Pro Asp Ala Val Lys 
145 150 155 160 

Gly Thr Pro Glu Asp Val Ala Gly Asn Leu Arg Trp Val Glu Thr Gly 

165 170 175 

Asn Lys Leu Lys Val Glu Asn Pro Thr Pro Phe Tyr Met Asn Leu Ala 
180 185 — 190- 

Ser Val Thr Val Gly Gly Lys Pro He Thr Gly Leu Glu Tyr Val Pro 
195 200 205 

Pro Phe Ala Asp Lys Thr Leu Asn Met Pro Gly Ser Ala His Gly Asp 
210 215 220 

He Glu Trp Arg Val He Thr Asp Phe Gly Gly Glu Ser His Pro Phe 
225 230 235 240 



His Tyr Val Leu Lys 

245 



<210> 4 
<211> 836 
<212> PRT 

<213> Salmonella typhi murium 
<400> 4 

Met Lys Phe Lys Gin Pro Ala Leu Leu Leu Phe He Ala Gly Val Val 

1 5 10 ' Z ~ 15 

His Cys Ala Asn Ala His Thr Tyr Thr Phe Asp Ala Ser Met Leu Gly 
20 25 30 



Asp Ala Ala Lys Gly Val Asp Met 

35 40 

Gin Pro Gly Thr Tyr Arg Val Asp 

50 B5 

Asp Thr Arg Asp Val Val Phe Lys 

65 __7Q_ 



Pro Val Leu Ala Pro Cys Leu Thr 

85 



Ser Leu Phe Asn Gin Gly Leu Gin 

45 

Val Met Val Asn Gly Lys Arg Val 
60 

Leu Glu Lys Asp Gly Gin Gly Thr 
75 80 



Val Ser Gin Leu Ser Arg Tyr Gly 
90 95 



Val Lys Thr Glu Asp Tyr Pro Gin Leu Trp Lys Ala Ala Lys Pro Pro 
100 105 HO 

Asp Glu Cys Ala Asp Leu Thr Ala He Pro Gin Ala Lys Ala Val Leu 
115 120 125 

Asp He Asn Asn Gin Gin Leu Gin Leu Ser He Pro Gin Leu Ala Leu 
130 135 140 

Arg Pro Glu Phe Lys Gly He Ala Pro Glu Asp Leu Trp Asp Asp Gly 
145 150 155 160 

He Pro Ala Phe Leu Met Asn Tyr Ser Ala Arg Thr Thr Gin Thr Asp 

165 170 175 



Tyr Lys Met Asp 
180 

Gin Pro Gly He 
195 

Trp Gin Arg Ser 
210 

Tyr Ala Glu Arg 
225 

Gin Lys Thr Ser 



Val Met Leu Ala 
260 

Phe Ala Pro Val 
275 



Met Val Gly Arg 



Asn He Gly Ala 
200 

Ser Gin Leu Ser 
215 

Gly Leu Tyr Ser 
230 

Gin Gly Glu He 
245 

Ser Asp Asp Asn 



Val Arg Gly He 
280 



Asp Asn Ser Ser 
185 

Trp Arg Val Arg 



Gly Lys Trp Gin 
220 

Leu Lys Ser Arg 
235 

Phe Asp Ser Val 
250 

Met Val Pro Tyr 
265 

Ala Arg Thr Gin 



Trp Val Gin Leu 
190 

Asn Ala Thr Ser 
205 

Ala Ala Tyr Thr 



Leu Thr Leu Gly 
240 

Pro Phe Thr Gly 
255 

Ser Glu Arg Gin 
270 

Ala Arg Val Glu 
285 



Val Lys Gin Asn Gly Tyr Thr lie Tyr Asn Thr Thr Val Ala Pro Gly 
290 295 300 



Pro Phe Ala Leu Arg Asp Leu Ser Val Thr Asp Ser Ser Gly Asp Leu 
305 310 315 320 

His Val Thr Val Trp Glu Ala Asp Gly Ser Thr Gin Met Phe Val Val 

325 330 " ~\ ' " 335 

Pro Tyr Gin Thr Pro Ala He Ala Leu His Gin Gly Tyr Leu Lys Tyr 
340 345 350 

Ser Leu Leu Ala Gly Arg Tyr Arg Ser Ser Asp Ser Ala Thr Asp Lys 
355 360 365 

Arg Gin He Ala Gin Ala Thr Leu Met Tyr Gly Leu Pro Trp Asn Leu 
370 375 380 

Thr Ala Tyr Gly Gly He Gin Ser Ala Thr His Asn Gin Ala Ala Leu 

385 390 __. __3?5 __^00__ 

Leu Gly Leu Gly Gly Ser Leu Gly Arg Trp Gly Ser Leu Ser Val Asp 

405 410 415 

Gly Ser Asp Thr His Ser Gin Arg Gin Gly Glu Ala Val Gin Gin Gly 
420 425 430 

Ala Ser Trp Arg Leu Arg Tyr Ser Asn Gin Leu Thr Ala Thr Gly Thr 
435 440 445 



Asn Phe Phe Leu 
450 

Leu Ser Asp Val 
465 

Ser Trp Arg Glu 



Ser Gin Ser Trp 
500 

Arg Thr Asp Trp 
515 

Ser Trp Gly Thr 
530 

Gin Asn Arg Thr 
54 5 

Thr Ser Leu Trp 



Val Ser Ala Ser 
580 

Gin Gin Val Gly 
595 



Thr Arg Trp Gin 

455 

Leu Asp Ser Tyr 
470 

Asn Leu Gin Pro 
485 

Gly Arg His Leu 



Arg Asn Arg Pro 
520 

Ser He Gly Gly 
535 

Leu Trp Arg Asn 
550 

Phe Ser Met Pro 
565 

Trp Gin Met Thr 



Val Asn Gly Glu 
600 



Tyr Ala Ser Gin 
460 

Arg His Asn Gly 
475 

Ser Ser Arg Thr 
490 

Gly Asn Leu Ser 
505 

Gly His Asp Asp 



Gly Ser Leu Ser 
540 

Gly Ala His Arg 
555 

Leu Ser Arg Trp 
570 

Ser Pro Ser His 
585 

Ala Phe Ser Gin 



Gly Tyr Asn Thr 



Asn Arg Leu Trp 
480 

Thr Leu Met Leu 
495 

Leu Thr Gly Ser 
510 

Ser Tyr Gly Leu 
525 

Leu Asn Trp Asn 



Lys Glu Asn He 
560 

Thr Gly Asn Asn 
575 

Gly Gly Gin Thr 
590 

Gin Leu Asp Trp 
605 



Glu Val Arg Gin Ser Tyr Arg Ala Asp Ala Pro Pro Gly Gly Gly Asn 
610 615 620 



Asn Ser Ala Leu His Leu Ala Trp Asn Gly Asp Tyr Gly Leu Leu Gly 
625 630 635 640 

Gly Asp Tyr Ser Tyr Ser Arg Ala Met Arg Gin Met Gly Val Asn lie 

645 650 " ~\ " " 655 

Ala Gly Gly lie Val lie His His His Gly Val Thr Leu Gly Gin Pro 
660 665 670 

Leu Gin Gly Ser Val Ala Leu Val Glu Ala Pro Gly Ala Ser Gly Val 
675 680 685 

Pro Val Gly Gly Trp Pro Gly Val Lys Thr Asp Phe Arg Gly Asp Thr 
690 695 700 

Thr Val Gly Asn Leu Asn Val Tyr Gin Glu Asn Thr Val Ser Leu Asp 

705 ZIP 715 ____ _ 720. 



Pro Ser Arg Leu Pro Asp Asp Ala Glu Val Thr Gin Thr Asp Val Arg 

725 730 735 

Val Val Pro Thr Glu Gly Ala Val Val Glu Ala Lys Phe His Thr Arg 
740 745 750 

He Gly Ala Arg Ala Leu Met Thr Leu Lys Arg Glu Asp Gly Ser Ala 
755 760 765 

He Pro Phe Gly Ala Gin Val Thr Val Asn Gly Gin Asp Gly Ser Ala 
770 775 780 

Ala Leu Val Asp Thr Asp Ser Gin Val Tyr Leu Thr Gly Leu Ala Asp 
785 790 795 800 

Lys Gly Glu Leu Thr Val Lys Trp Gly Ala Gin Gin Cys Arg Val Asn 

805 810 815 

Tyr Arg Leu Pro Ala His Lys Gly lie Ala Gly Leu Tyr Gin Met Ser 
820 825 830 

Gly Leu Cys Arg 
835 



<210> 5 
<211> 156 
<212> PRT 

<213> Salmonella typhi murium 
<400> 5 

Met Trp Met Lys lie Gin Arg Val Lys Thr Val He Tyr Ser Val Ser 
1 5 10 15 

Leu Leu Val Ala Ala Ser Ser Leu Val Pro He Ala Asn Ala Ala Glu 
20 25 30 



Lys Leu Gin Thr Thr Leu Arg Val Gly Thr Tyr Phe Arg Ala Gly His 
35 40 45 



Val Pro Asp Gly Met Val Leu Ala Gin Gly Trp Val Thr Tyr His Gly 
50 55 60 



Ser His Ser Gly Phe Arg Val Trp Ser Asp Glu Gin Lys Ala Gly Asn 

65 70 75 80 

Thr Pro Thr Val Leu Leu Leu Ser Gly Gin "Gin Asp Pro Arg His His 

85 90 " 95 

lie Gin Val Arg Leu Glu Gly Glu Gly Trp Gin Pro Asp Thr Val Ser 

100 105 110 

Gly Arg Gly Ala He Leu Arg Thr Ala Ala Asp Asn Ala Ser Phe Ser 

115 120 125 

Val Val Val Asp Gly Asn Gin Glu Val Pro Ala Asp Thr Trp Thr Leu 

130 135 140 

Asp Phe Lys Ala Cys Ala Leu A la G in Glu Asp Thr^ _ 

145 150 15 5 



<210> 6 
<211> 9253 
<212> DNA 

<213> Salmonella typhi 

<220> 
<221> CDS 

<222> (1898) . . (2608) 

<223> tcfA putative fimbria! subunit 



<220> 
<221> CDS 

<222> (2659) . . (3234) 

<223> tcfB putative fimbrial subunit 

<220> 
<221> CDS 

<222> (3360) . . (6029) 

<223> tcfC putative fimbrial subunit 



<2-2 0> 
<221> CDS 

<222> (6052) . . (7131) 

<223> tcfD putative fimbrial subunit 

<220> 
<221> CDS 

<222> (7264) . . (7719) 

<223> tinR putative transcriptional regulator 
<400> 6 

tgtaagtatc ccgcataatc gagccattca catttagaga tcatccggca taatcaatct 60 
gccaacgcag gagatcgctg tgcgtaaagc ccgtattact gcgcaccaga tcatcgctgt 12 



gattagatca gttgaatccg gacggactgt taaagatgtc taccgggagg ccggtatttc 180 
tgaagccacc agggacaact ggaagtctgg atacggcggc atggaagctt ctgatattaa 2 40 
atcttgagga tgtcaacgcc aggatttatg gtcgtttttc gctattttta atatccgctg 3 00 
tttgatcact tctgctgtcc gctttccgcc atttcatctt cactgattgg cgttgcgctt 3 60 
ttggtcagcg ccccgacttt gcgtttttcc ttcagttggt aactttctcc tttgatattc 420 
agtgtggttg agtgatgcag tagccgatcc aggatcgctg ttgccagcac gttatcaccg 480 
aacatctctc cccagtctgc gaagcctttg* tttgacgtca ggataatgct cgctttttca 540 
taccgtcggt tcagcagtcg gaagaacaga ctggcttcct cactggtcat tggcaggtaa 6 00 
cctatttcgt ccaggatcag cccccgcgca tagctcagtt gttgtagctg gcgctccagg 660 
cggttttcca gcttcgcttt catcagcgtc taccaacctg tcctgcggca tgaacaacac 720 



^ccgatggccc^acatctgccg ctttcacacc gggggcagcg gccaggtggg ttttctccac 780 

tccgggggcc ccagcaggat cacattctcg cagcgctcca cgaacgccag accggccaac 840 

tcccggacga ccttacgatc gatgcctggc tggaagctga agtcgaactg ctccagcgtt 9 00 

ttgactcatg gcagacgagc ctgtttcagc cgggattcca ttccgcgctg atgtctgccg 960 

ttctattcct gctgcagcgc catgcacagg aatgtcgact tattttgggc tgacagaggc 1020 

attggtcgtt aagacttgta tattcatgtt gcttactcgc acaaaagggg gaaaggaacc 1080 

gtaaacaaca ttgacaggta agggactggc atcaatttga atggtatttt ttataacatt 1140 

atgttttaaa aggtgaattg tattttcatg gtgagttgtt ctgttttata ttgttgtttg 1200 

tggaatattt agctaatatt taatctaaaa atagtgtatt atattgatta cactttgctg 1260 

gagaggtgta aacatcaagc gtaaccatat gatgtatata agtttttgtt tgctgatatc 1320 

atatttttaa cagatattga gcaagtaaat ctgatctaac tgtaacttta gttttctaaa 1380 

ttaattactt tgcttacttt tttttacctg gttttgtaaa acctcatgat gatcagtcta 1440 

tttgtggctt tgtttaaggt ttatttggtt ttgtagtctg aattgtatct ccctcctgga 1500 

gggctttctt tagattctct gtctcctacg ttgttatgtt acgtatttgt tgctttgaag 1560 

gagggggaaa tacagttcca tttatctgag taagtcaggt acacagtaac aactttctta 1620 

tgaagaattt ccaaaatttt tactgcggcg ttattaattg ttcagcgatt cttacagatc 1680 

tgtcgttcgc ttttggtgaa tgaaatccgt ggacttttat ttactaattt tttctttcct 1740 

gaaaaaaaca gaggtattga gcgaaaaatt ttattccgta tgatgccctc cacacaaaat 1800 

gtattaacac tgaatcgtaa tttgcttctt tatgctgata actttctgtc tatgctaata 1860 

ctaaaattta gatgactttt atacggtaaa atctggt atg aat ttt aaa gat act 1915 

Met Asn Phe Lys Asp Thr 
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ctt ccc ggg gtg ttt etc tgt gtc get atg ttt gca tgt ggt cat gec 1963 

Leu Pro Gly Val Phe Leu Cys Val Ala Met Phe Ala Cys Gly His Ala 
10 15 20 

agg gcg aat atg etc gtt tat ccc atg gcg gca gaa att aat agt age 2 011 

Arg Ala Asn Met Leu Val Tyr Pro Met Ala Ala Glu lie Asn Ser Ser 

25 30 "* 35 



cgc gaa gag gec acc teg ctg ttc gtc tat tct aaa tea gat cat gtg 
Arg Glu Glu Ala Thr Ser Leu Phe Val Tyr Ser Lys Ser Asp His Val 
40 45 50 



2059 



2155 



2203 



2251 



2299 



caa tat att cga aca aga ate atg cgt att gaa cac ccc ggt atg cca 2107 
Gin Tyr lie Arg Thr Arg lie Met Arg lie Glu His Pro Gly Met Pro 
55 60 65 70 

cag gag aag gag gta cca gca_ggg aat gat ata gag aca gga ctt gtt 
Gln Glu^Lys Glu T Val - Pro Ala"" Gly Asn Asp He Glu Thr Gly Leu Val 

75 80 85 

gtc tec ccg gag aaa ttt get ctt tec ccg gga aca aaa aaa aca ata 
Val Ser Pro Glu Lys Phe Ala Leu Ser Pro Gly Thr Lys Lys Thr He 
90 95 100 

cgt gtt ate agt act cag gca ccg gaa aga gag gaa gee tgg egg gta 
Arg Val He Ser Thr Gin Ala Pro Glu Arg Glu Glu Ala Trp Arg Val 
105 HO US 

tac ttc gag get gtt cct gaa ctg gaa gat gat cca cag gca ggc gga 
Tyr Phe Glu Ala Val Pro Glu Leu Glu Asp Asp Pro Gin Ala Gly Gly 
120 125 130 

aag caa aat tea tec gta agt gtg aat ctt gtc tgg ggg gtg ttg ctg 2347 
Lys Gin Asn Ser Ser Val Ser Val Asn Leu Val Trp Gly Val Leu Leu 
135 140 145 150 

cgt gtt tct ccg tea gac ccc agg cct gcg ctg gta acg gac ggt cac 2395 
Arg Val Ser Pro Ser Asp Pro Arg Pro Ala Leu Val Thr Asp Gly His 

155 160 165 

cac ctg ctg aat acg gga aac aca egg ctt tct ctt att egg get ggc 
His Leu Leu Asn Thr Gly Asn Thr Arg Leu Ser Leu He Arg Ala Gly 
170 175 180 

aac tgc gac acc aca tgc cac tgg cag aat ata ggc aaa agt att tat 
Asn Cys Asp Thr Thr Cys His Trp Gin Asn He Gly Lys Ser He Tyr 
185 190 195 

ccc ggc ggg agt get gat att ccg gec gga ata aaa agt aat gca ttt 
Pro Gly Gly Ser Ala Asp He Pro Ala Gly He Lys Ser Asn Ala Phe 
200 205 210 

cgt gtg gaa tat cgt acg ggt gca aat tea ccg gta ate tct get gat 
Arg Val Glu Tyr Arg Thr Gly Ala Asn Ser Pro Val He Ser Ala Asp 
215 220 225 230 

tta aca gca gee gga aag taa aaacacaegg agegtacget ataccctaca 2638 



2443 



2491 



2539 



2587 
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2883 



Leu Thr Ala Ala Gly Lys 

235 

tttattctca gggggagcgg atg tat acc gag tgt aca tat ate act gta ata 2691 

Met Tyr Thr Glu Cys Thr Tyr lie Thr Val He 
240 245 

aac aac aaa gca agg tta ttt ttt atg aac -atg aaa aca tct ttt att 2739 
Asn Asn Lys Ala Arg Leu Phe Phe Met Asn Met Lys Thr Ser Phe He 
250 255 260 

gec gca get gtg gca ttg gec acc gtt tat tct ttt tct gtt tct gcg 2787 
Ala Ala Ala Val Ala Leu Ala Thr Val Tyr Ser Phe Ser Val Ser Ala 
265 270 275 280 

gtt cag aag gat att acc gtc act gec aat att gac agt aca ctt gaa 2835 
Val Gin Lys Asp He Thr Val Thr Ala Asn He Asp Ser Thr Leu Glu 

285 290 295 

ctg ctg cag gec gat ggt tea tec etc ccg teg act atg aag ctg gat 
Leu Leu Gin Ala Asp Gly Se"r~ Ser "Leu Pro Ser Thr Met" Lys Leu Asp" 
300 305 310 

ttc atg ccg ggt aag ggc ctg gtc cat aaa tea etc cag acc cgc ctt 
Phe Met Pro Gly Lys Gly Leu Val His Lys Ser Leu Gin Thr Arg Leu 
315 320 325 

tac age aac gat cag acc aag teg gtt aat gta aaa ctg ttg aat get 
Tyr Ser Asn Asp Gin Thr Lys Ser Val Asn Val Lys Leu Leu Asn Ala 
330 335 340 

cca caa ctt ate aac gtc ctg gat ccc acc aaa acc att gat atg gaa 
Pro Gin Leu He Asn Val Leu Asp Pro Thr Lys Thr He Asp Met Glu 
345 350 355 360 

gtg act ctg gga gga egg tea ctg acc acc acc aat tct gta ctg gaa 
Val Thr Leu Gly Gly Arg Ser Leu Thr Thr Thr Asn Ser Val Leu Glu 

365 370 375 

get aaa acc ctg ttc ccg gac gga aaa act ggc gat get tea get ctg 
Ala Lys Thr Leu Phe Pro Asp Gly Lys Thr Gly Asp Ala Ser Ala Leu 
380 385 390 

ctg aac ctg gat att ggt cag aag get gga gea gec tta caa aac ctg 
Leu Asn Leu Asp He Gly Gin Lys Ala Gly Ala Ala Leu Gin Asn Leu 
3 95 400 4 05 

cct gec ggt gaa tac age gga ttg gtc agt ctg gtg att tea cag get 3 219 
Pro Ala Gly Glu Tyr Ser Gly Leu Val Ser Leu Val He Ser Gin Ala 
410 415 420 

gtc act gee ggc taa taactggtta ttagctcttc atctgatccg gttttggggg 3274 
Val Thr Ala Gly 

42 5 

gcaccgttcg tacctgaacc ggatccggta ttgatcttat tattcattgc aattcaggtc 3334 

tetttaegtg agtcgttatt tctgg atg tat tat tta ctg gga ttg tgc agt 3386 

Met Tyr Tyr Leu Leu Gly Leu Cys Ser 
430 435 



2931 



2979 



3027 



3075 



3123 



3171 



ttt acc age cag gca act ctt att ccc cct cct gga ttt gaa tct ctg 3 434 
Phe Thr Ser Gin Ala Thr Leu lie Pro Pro Pro Gly Phe Glu Ser Leu 
440 445 450 

ctg gaa gga cag act gag caa att gaa gtg ttg eta cca ggg cat tea 3 482 
Leu Glu Gly Gin Thr Glu Gin lie Glu Val Leu Leu Pro Gly His Ser 
455 460 "465 ^ ' 470 

ctg gga tta ttt ccg gtg gtg gtt aaa ccg gac acc gtg cag ttc atg 3 530 
Leu Gly Leu Phe Pro Val Val Val Lys Pro Asp Thr Val Gin Phe Met 

475 480 485 

tec cca ttg atg gta ctt gaa age agt ggg ctt gec gcg ttg ccg gec 3 578 
Ser Pro Leu Met Val Leu Glu Ser Ser Gly Leu Ala Ala Leu Pro Ala 
490 495 500 

gca gaa egg caa aaa gcg ctg get gca etc age cgt ccg ttg eta cgt 3 626 
Ala Glu Arg Gin Lys Ala Leu Ala Ala Leu Ser Arg Pro Leu Leu Arg 

505 __510 515 _ _ 



aac age aat ctg gtc tgt ggt gtc tea gaa gca aaa gac age age gag 3 674 
Asn Ser Asn Leu Val Cys Gly Val Ser Glu Ala Lys Asp Ser Ser Glu 
520 525 530 

tgt ggt tac gtg gca aca gat aaa gag gat gtt gcg gtt att ttt gat 3 722 
Cys Gly Tyr Val Ala Thr Asp Lys Glu Asp Val Ala Val lie Phe Asp 
535 540 545 550 

gag aac aac get cag tta tct ttg ttt ctt aac egg gac tgg ttg ccg 3 770 
Glu Asn Asn Ala Gin Leu Ser Leu Phe Leu Asn Arg Asp Trp Leu Pro 

555 560 565 

gat gaa gaa cga cgt gat aaa cgc tgg ctg act ccg acc ccg gag ggt 3 818 
Asp Glu Glu Arg Arg Asp Lys Arg Trp Leu Thr Pro Thr Pro Glu Gly 
570 575 S80 

gtc age gca ttt att cac cgc cag acg ctg tat ctg agt gat gat etc 3 866 
Val Ser Ala Phe lie His Arg Gin Thr Leu Tyr Leu Ser Asp Asp Leu 
585 590 595 

cac agt cgt aat atg aca ctg aat ggt age ggt gec ctg ggg ctt ggt 3 914 
His Ser Arg Asn Met Thr Leu Asn Gly Ser Gly Ala Leu Gly Leu Gly 
600 605 610 

gac ggt "cgt tat ctg gga ggc gac tgg gcg get ate tgg aat cag ttTa 3 962 
Asp Gly Arg Tyr Leu Gly Gly Asp Trp Ala Ala lie Trp Asn Gin Ser 
615 620 625 63 0 



gaa cat tac aat aac agt cag gee tgg ttt gac aat ctg ttt gtc cgt 
Glu His Tyr Asn Asn Ser Gin Ala Trp Phe Asp Asn Leu Phe Val Arg 

635 640 645 

cag gat etc ggc aat cag tat tat etc cag get ggt egg atg gat cag 
Gin Asp Leu Gly Asn Gin Tyr Tyr Leu Gin Ala Gly Arg Met Asp Gin 
650 655 660 

egg aat ctg tec age gec acg ggg ggg gat ttt ggg ttc agt ctg ctt 
Arg Asn Leu Ser Ser Ala Thr Gly Gly Asp Phe Gly Phe Ser Leu Leu 
665 670 675 



4010 



4058 



4106 
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ccc ctg age egg ttt gat gga tta cga acc ggg acc acc caa get tat 4154 

Pro Leu Ser Arg Phe Asp Gly Leu Arg Thr Gly Thr Thr Gin Ala Tyr 
680 685 690 

gtt aac cat gag gtg gac cat aat gec act ccg gtt atg gtt cag gtt 4 202 

Val Asn His Glu Val Asp His Asn Ala Thr Pro Val Met Val Gin Val 

695 700 -705 - - 710 

acc cga aat gec cgt att gat att tat cgt ggc age gag ttg ctg ggg 4 250 

Thr Arg Asn Ala Arg He Asp He Tyr Arg Gly Ser Glu Leu Leu Gly 
715 720 725 

agt cag ttc ctg acc ccg gga atg cat acc ctg gat act cat tct ctt 4298 

Ser Gin Phe Leu Thr Pro Gly Met His Thr Leu Asp Thr His Ser Leu 

730 735 740 

cca ccg gga age tat cct ctg gcg ttg egg gtg tat gag gat ggg att 4 34 6 

Pro Pro Gly Ser Tyr Pro Leu Ala Leu Arg Val Tyr Glu Asp Gly He 

745 750 755 



ct 9 c 99 c 9 a ac 9 9 a 9 acc ca 9 ccc ttc a 9t aa 9 999 99C aat age ttc 4 3 94 

Leu Arg Arg Thr Glu Thr Gin Pro Phe Ser Lys Gly Gly Asn Ser Phe 
760 765 770 

a 9t gca cag acc cag tgg ttt att cag ggc ggg ctg gaa gat acc ggg 4 442 

Ser Ala Gin Thr Gin Trp Phe He Gin Gly Gly Leu Glu Asp Thr Gly 
775 780 785 790 

gat aaa gec age cat tat gac ggt gag act gtc atg get gec gga ttc 4490 

Asp Lys Ala Ser His Tyr Asp Gly Glu Thr Val Met Ala Ala Gly Phe 
795 800 805 

caa act ggg ctg egg aaa aat ate agt ctg acc gaa ggt ate tct ctg 4 538 

Gin Thr Gly Leu Arg Lys Asn He Ser Leu Thr Glu Gly He Ser Leu 

810 815 820 

gca cat gag gec tgg tac agt gaa acc cga ctg aat tea cag cat gca 4 586 

Ala His Glu Ala Trp Tyr Ser Glu Thr Arg Leu Asn Ser Gin His Ala 

825 830 835 

gtg ctg gat ggc acg ctg gac ctt tct gec ggg ata ctg cat ggg aca 4 63 4 

Val Leu Asp Gly Thr Leu Asp Leu Ser Ala Gly He Leu His Gly Thr 
840 845 850 

gac age acg age ggt aac act gag cag gtg aca tac aac gae-gga ttt 4682 

Asp Ser Thr Ser Gly Asn Thr Glu Gin Val Thr Tyr Asn Asp Gly Phe 
855 860 865 87 0 

tec gcg agt ctg tgg cgt aac cat acg gaa agt gat gee tgt agt ggt 4 73 0 

Ser Ala Ser Leu Trp Arg Asn His Thr Glu Ser Asp Ala Cys Ser Gly 
875 880 885 

cgt cat cca cag tea gtg cat gee agt atg acc tgc cag act teg atg 4 778 

Arg His Pro Gin Ser Val His Ala Ser Met Thr Cys Gin Thr Ser Met 

890 895 900 

aac gec tec ctg teg gtt teg gtg ggg aac tgg tat gee eta ctg gga 4 826 

Asn Ala Ser Leu Ser Val Ser Val Gly Asn Trp Tyr Ala Leu Leu Gly 

905 910 915 
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tac agt acc age agg aca gaa ggt egg ccg gtt tac egg gga tat gat 4874 

Tyr Ser Thr Ser Arg Thr Glu Gly Arg Pro Val Tyr Arg Gly Tyr Asp 
920 925 930 

gat aac agt gac aaa gaa aat gtg ttc tgg cga cag gca tac ate cct 4 922 

Asp Asn Ser Asp Lys Glu Asn Val Phe Trp Arg Gin Ala Tyr lie Pro 

935 940 "945 " " 950 



gee tct cac cgc gaa tct get cag get agt gca acg tac age ctt aat 

Ala Ser His Arg Glu Ser Ala Gin Ala Ser Ala Thr Tyr Ser Leu Asn 

955 960 965 

atg get ggc atg aat att aat acc cat ggg gga gta tgg cga acc cga 

Met Ala Gly Met Asn lie Asn Thr His Gly Gly Val Trp Arg Thr Arg 
970 975 980 

aat gac gga gtg aat gat gat ggc ttg ttt atg agt gtc agt gtg tea 

Asn Asp Gly Val Asn Asp Asp Gly Leu Phe Met Ser Val Ser Val Ser 

985 990 995 



gec ggg acc gat att cac agt age egg aat caa aaa aca cag acg tec 
Ala Gly Thr Asp lie His Ser Ser Arg Asn Gin Lys Thr Gin Thr Ser 
1015 1020 1025 1030 

tgg aat gtg aac cat gtg aga tec tgg cag cag gat ctg tat cgt gaa 
Trp Asn Val Asn His Val Arg Ser Trp Gin Gin Asp Leu Tyr Arg Glu 
1035 1040 1045 



4970 



5018 



5066 



tat gee tct caa cca ccg aca atg act ggc agt aat agg tat acc tea 5114 
Tyr Ala Ser Gin Pro Pro Thr Met Thr Gly Ser Asn Arg Tyr Thr Ser 
1000 1005 1010 



5162 



5210 



ctg teg gtg ggt ttc tec ggt tat aac gac gac age tgg age ggg agt 52 58 
Leu Ser Val Gly Phe Ser Gly Tyr Asn Asp Asp Ser Trp Ser Gly Ser 
1050 1055 1060 

etc ggc gga cgc atg age ggc cgt atg ggt gaa ctg age gee act ate 53 06 
Leu Gly Gly Arg Met Ser Gly Arg Met Gly Glu Leu Ser Ala Thr lie 
1065 1070 1075 

agt aac tec cat caa cgt aat gcg ggc age gee agt tea etc acc get 
Ser Asn Ser His Gin Arg Asn Ala Gly Ser Ala Ser Ser Leu Thr Ala 
1080 1085 - 1090 — 

ggc tac age teg tct ctg gcg tta tec cgt aat gga ctg ttc tgg gga 
Gly Tyr Ser Ser Ser Leu Ala Leu Ser Arg Asn Gly Leu Phe Trp Gly 
1095 1100 H05 lHO 

ggt ggt cag gac ggt gaa ccg gec tct ggc atg gcg gtg aac gtg gag 5450 
Gly Gly Gin Asp Gly Glu Pro Ala Ser Gly Met Ala Val Asn Val Glu 
1115 1120 H25 

tea gag ggg gac gag ggc agt age ggg aaa gta gtc age gtt cgt ggc 5498 
Ser Glu Gly Asp Glu Gly Ser Ser Gly Lys Val Val Ser Val Arg Gly 
1130 H35 H40 

age age cag ccg ttc agt etc ggt ttt ggt cag cag teg ctg ttg ctg 5546 



5354 



5402 



ir-, *> ir-. ,^ f 

u.j- ii™ ^.s ^ '\f t.u », 



- 

S„„t C>t W i£„ 



5690 



5738 



5786 



Ser Ser Qln Pro Phe Ser Leu Gly Phe Gly Gin Gin Ser Leu Leu Leu 
1145 H50 1155 

atg gaa ggc tat aac gcc acg gag gtg acc att gag gat gca ggg gtt 5 594 
Met Glu Gly Tyr Asn Ala Thr Glu Val Thr lie Glu Asp Ala Gly Val 
1160 1165 1170 

agt tea cag ggt atg gca ggc gta aaa gcg gga ggg gga age agg tgt 5 642 
Ser Ser Gin Gly Met Ala Gly Val Lys Ala Gly Gly Gly Ser Arg Cys 
1175 1180 1185 1190 

tac ttc ctg aca ccc ggg cat ctg ctg gtt cac aac ate age gcc agt 
Tyr Phe Leu Thr Pro Gly His Leu Leu Val His Asn lie Ser Ala Ser 
1195 1200 1205 

atg age cga ctg tac gtt ggc cgc gta ctg gac aag gat ggc aga ccg 
Met Ser Arg Leu Tyr Val Gly Arg Val Leu Asp Lys Asp Gly Arg Pro 
1210 1215 1220 

ctg ctg gac gca^cag^cca ctg aac tat cca ttt ttg _tcg_ ttg_ gga cct 
Leu Leu Asp Ala Gin Pro Leu Asn Tyr Pro Phe Leu Ser Leu Gly Pro 
1225 1230 1235 

tec ggg cga ttt age ctg cag age gag cat aaa gaa tec age ctg tgg 5 834 
Ser Gly Arg Phe Ser Leu Gin Ser Glu His Lys Glu Ser Ser Leu Trp 
1240 1245 1250 

ctg ctg tct aaa aac agg ate ctg cgt tgt ccg atg tea gta cat aaa 5882 
Leu Leu Ser Lys Asn Arg He Leu Arg Cys Pro Met Ser Val His Lys 
1255 1260 1265 1270 

cgt egg gat gtt atg cag gta gtg ggt gat gtg egg tgt gaa tta agt 
Arg Arg Asp Val Met Gin Val Val Gly Asp Val Arg Cys Glu Leu Ser 
1275 1280 1285 

gac gtg gat gcc ctg cca cag gcg ttg caa at a teg ccg egg gtc ate 
Asp Val Asp Ala Leu Pro Gin Ala Leu Gin He Ser Pro Arg Val He 
1290 1295 1300 

cgt ttg ctg aac gtg gca ggt ttg ctg cgc cat tec gtt cag gaa gcc 
Arg Leu Leu Asn Val Ala Gly Leu Leu Arg His Ser Val Gin Glu Ala 
1305 1310 1315 

tga cgtagagata aaggcgttaa ct atg agt aat aaa atg aag tgg acg agt 6078 

Met Ser Asn Lys Met Lys Trp Thr Ser 
1320 — 1325 



5930 



5978 



6026 



atg aca gcc cat tgg tea gca att att aat ttc ate cga aaa tat gtt 
Met Thr Ala His Trp Ser Ala He He Asn Phe He Arg Lys Tyr Val 
1330 1335 1340 

tat cca gca agg ata att gcc ate ctg ctg atg get ggc get aca ctg 
Tyr Pro Ala Arg lie He Ala He Leu Leu Met Ala Gly Ala Thr Leu 
1345 1350 1355 1360 

cca caa gtc gcc gat gcg att acc gtc gac ctg aat tac gac aag aac 
Pro Gin Val Ala Asp Ala He Thr Val Asp Leu Asn Tyr Asp Lys Asn 
1365 1370 1375 



6126 



6174 



6222 



aat gta gcg gtc ate act cct gtc tgg tec caa gaa tgg agt gta gca 



6270 
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Asn Val Ala Val He Thr Pro Val Trp Ser Gin Glu Trp Ser Val Ala 
1380 1385 1390 

aat gtg ttg ggg gga tgg gta tgt cgt tea aac agg aat gaa aat gag 6318 
Asn Val Leu Gly Gly Trp Val Cys Arg Ser Asn Arg Asn Glu Asn Glu 
1395 1400 1405 



999 gcg tgt gaa gaa aca cat ttg gta tgg "tgg fat get ttt gga get 
Gly Ala Cys Glu Glu Thr His Leu Val Trp Trp Tyr Ala Phe Gly Ala 
1410 1415 1420 

tat tea aaa att cgt ctg cgt ttc aga gaa caa ate age cat gee gaa 
Tyr Ser Lys He Arg Leu Arg Phe Arg Glu Gin lie Ser His Ala Glu 
1425 1430 1435 1440 

att acg etc ata ctg etc ggc agt gtt cgt gat gee tgt tat act ggt 
He Thr Leu He Leu Leu Gly Ser Val Arg Asp Ala Cys Tyr Thr Gly 
1445 1450 1455 

gtc ate aac^atg aac gct__gct__ gca tgt c aa t gg ggt__agg teg ctg^aaa 
Val lie Asn Met Asn Ala Ala Ala Cys Gin Trp Gly Arg Ser Leu Lys 
1460 1465 1470 

ctt agg ata cct tea gaa gag ctt gcg aag ata cct aca age gga aca 
Leu Arg lie Pro Ser Glu Glu Leu Ala Lys He Pro Thr Ser Gly Thr 
1475 1480 1485 

tgg aaa gca acg tta gtc ctg gat tat tta caa tgg ggc gga gac gat 
Trp Lys Ala Thr Leu Val Leu Asp Tyr Leu Gin Trp Gly Gly Asp Asp 
1490 1495 1500 



6366 



6414 



6462 



6510 



6558 



6606 



6654 



6702 



6750 



cct tta ggc aca tea act aca gat ate acg ctg aat gta aca gac cac 
Pro Leu Gly Thr Ser Thr Thr Asp He Thr Leu Asn Val Thr Asp His 
1505 1510 1515 1520 

ttt get gaa aat gcg get att tac ttt ccg caa ttt ggt aca gca acg 
Phe Ala Glu Asn Ala Ala He Tyr Phe Pro Gin Phe Gly Thr Ala Thr 
1525 1530 1535 

ccc egg gtg gac ctg aat ctt cac egg atg aat gee tea caa atg teg 
Pro Arg Val Asp Leu Asn Leu His Arg Met Asn Ala Ser Gin Met Ser 
1540 1545 1550 

ggc agg get aat ctg gat atg tgt ctg tat gac gga ggt- gtg aaa gec 6798 
Gly Arg Ala Asn Leu Asp Met Cys Leu Tyr Asp Gly Gly Val Lys Ala 
1555 1560 1565 

cgt tea tta cag atg aag ata gaa gga age aat aag tea ggt acg gga 
Arg Ser Leu Gin Met Lys He Glu Gly Ser Asn Lys Ser Gly Thr Gly 
1570 1575 1580 

ttt cag gtt ata aag age gat tct get gat acg att gat tat gcg gtc 
Phe Gin Val He Lys Ser Asp Ser Ala Asp Thr He Asp Tyr Ala Val 
1585 1590 1595 1600 

agt atg aat tat ggg gga cga agt att cct gtc ace cgt ggc gtg gag 
Ser Met Asn Tyr Gly Gly Arg Ser He Pro Val Thr Arg Gly Val Glu 
1605 1610 1615 



6846 



6894 



6942 



ttc agt ctg gat aac gtg gat aaa gca gca acg cgt ccg gtg gta ctt 

Phe Ser Leu Asp Asn Val Asp Lys Ala Ala Thr Arg Pro Val Val Leu 
1620 1625 1630 

ccc ggg caa egg cag gcg gta cgt tgt gtg cca gtg ccc ctt acc ctg 

Pro Gly Gin Arg Gin Ala Val Arg Cys Val Pro Val Pro Leu Thr Leu 

1635 1640 " " 1645 

aca aca caa ccc ttt aac ate aga gag aag cgt tct ggt gag tat cag 

Thr Thr Gin Pro Phe Asn lie Arg Glu Lys Arg Ser Gly Glu Tyr Gin 
1650 1655 1660 



6990 



7038 



7086 



gga acg ctg aca gtg aca atg ctg atg gga aca caa acc ccc tga 7131 
Gly Thr Leu Thr Val Thr Met Leu Met Gly Thr Gin Thr Pro 
1665 1670 1675 

cagtaattat ttattttatt gatatctttc ttatatggtt ttttaaatca gagttctctt 7191 

_t_at_ atac t tg t ttt at ttaa ta aagagaat cta ttcactt atgaaaat ca _at gcgtgagg 7 2 51 

ttctgettte ct atg act gtg tat tta gat gat aaa gat aaa gaa tta ttg 7 302 
Met Thr Val Tyr Leu Asp Asp Lys Asp Lys Glu Leu Leu 
1680 1685 1690 



aaa gaa ate caa aaa gat tgt gca caa act tta tgg caa ctt gca tat 

Lys Glu lie Gin Lys Asp Cys Ala Gin Thr Leu Trp Gin Leu Ala Tyr 
1695 1700 1705 

aaa gtg gga ctt acg ccc aca cca tgt ttc aaa cgt tta aaa aaa ctt 

Lys Val Gly Leu Thr Pro Thr Pro Cys Phe Lys Arg Leu Lys Lys Leu 
1710 1715 1720 



7350 



7398 



aaa gac agg ggg gtt ate att ggt cag ttc get tta ttg gat aag gaa 7446 

Lys Asp Arg Gly Val He He Gly Gin Phe Ala Leu Leu Asp Lys Glu 
1725 1730 1735 1740 

aaa eta ggt ctt tea ctt aat gtc ttt att atg att aac ata tct gag 7494 

Lys Leu Gly Leu Ser Leu Asn Val Phe He Met lie Asn He Ser Glu 
1745 1750 1755 



gag caa tac get agt att tct gag aaa ata aag tea atg cct gag gtt 
Glu Gin Tyr Ala Ser He Ser Glu Lys He Lys Ser Met Pro Glu Val 
1760 1765 1770 

att gec ttt tat cga att tct gga tea ttt aat tat tta atg cat aca 
He Ala Phe Tyr Arg He Ser Gly Ser Phe Asn Tyr Leu Met His Thr 
1775 1780 1785 

gta ttt aca gat atg aac gat tac tat agt ttt tat gag aaa ata ata 
Val Phe Thr Asp Met Asn Asp Tyr Tyr Ser Phe Tyr Glu Lys He He 
1790 1*795 1800 

tta act aat tct tea att agt gga tct gca teg age ttt gtt ctt gag 
Leu Thr Asn Ser Ser He Ser Gly Ser Ala Ser Ser Phe Val Leu Glu 
1805 1810 1815 1820 



7542 



7590 



7638 



7686 



caa ata aag 
Gin He Lys 



gaa aca aac gaa ctg tea gtg tga aagtgtgatg tgtacttact 7739 
Glu Thr Asn Glu Leu Ser Val 
1825 1830 



gatttaatac attattatcc ttcttacgga acaacaacgg cagattgcgg ctgttgaaca 7799 
aggattttaa tcagcagtgg tgaaattaag cggcacagaa taacacagcg gaatatcaca 7 859 
tggttaaata tcaccccgtg catgtaacaa aaaaccgcat taaaacagat gatgttactg 7 919 
atatttattt cgttgaaccc ttctggaaaa aaggcga^aa ccacataatt gagtcattga 7 979 
tgttttttga agagttacaa aagtcattta atttattcaa ccataaatat gggttaaata 8039 
aatatatact caggatcccc tgggaatttg tgctcataca tatggaaagg atcagtaaat 8 099 
taaatagcgt cgggttattt gctgtttctg ttgactttaa taacaaccac aaatttctga 8159 
gcgagtacat caggagtcgc agagattatg gtatggaagt ttggtttgat ttttgtggta 8219 
aacattctta ttccagtgaa attaaaaacc ttggattctt ttttcaggct tgcgtagtgc 8279 
ctcgtgatcc taattttatt agtagtgttt atcattatca taagttccaa aagattcttg 8 3 39 
tcggggatat aaatgatgta gaacagaggg ccgtgtacca gaacgaagtt gattacatgt 8 3 99 
atggaatgca atggccatcg tcatatgacg gttttttctt tcgggatcat aaaaaaaatg 8459 
aaacttggtg tatataacag aaggagtgaa aatttgaatc aaaaatatct tatttatttt 8 519 
ttgtttaatt attgttttgt tttttattac gattaaatat aaagaacatc attgttcgtg 8 579 
cggtggggag gctggaagtt taggggatga ccgtttatca acaattttat tacagccacc 8 639 
atacgaatgg tttatatatg cactagatgt attattttag tttaatatat cgatggttgc 8 699 
tatttgcatt gatgatgttc cgttacatta aggaatatac atctgtatct cgttatacgc 8759 
acactcacat tactaatcat tattaatatg agtgtggttc ttgttttacg catgcatggt 8 819 
tgcatgtgac gttaaattta aatgagctga ctgtatgaat tctaaatact ttagagaggt 8 879 
gttttttgtc tcggtagttg ttatattatt attttatttg gtgttatttg cagccagtgc 8939 
tcatgctgaa ggcggtttca gatctggagg cattgggtta tttatgacgg gaacaagaga 8 999 
gatgctactg tagagataat aaattctgct aaagattccc caattcttgt gcattgacat 9059 
cctccacgtc ctgaagggcg tgggttcctg ctccaacggg ctgcctgact gcacgctcct 9119 
tccacaggca agcacggcgt gtcccgctct aaaatgttac gcgcgccgtt tacatcggcg 9179 
ttcgcagtat atcttcatac cagacacttg taagtatctc gcataatcgt gccattcaca 9239 
tttagagatc atac 9253 

<210> 7 

<211> 236 

<212> PRT 

<213> Salmonella typhi 



<400> 7 



Met Asn Phe Lys Asp Thr Leu Pro Gly Val Phe Leu Cys Val Ala Met 
1 5 10 15 



Phe Ala Cys Gly His Ala Arg Ala Asn Met Leu Val Tyr Pro Met Ala 
20 25 30 

Ala Glu lie Asn Ser Ser Arg Glu Glu Ala Thr Ser Leu Phe Val Tyr 
35 40 45 - 

Ser Lys Ser Asp His Val Gin Tyr lie Arg Thr Arg lie Met Arg lie 
50 55 60 

Glu His Pro Gly Met Pro Gin Glu Lys Glu Val Pro Ala Gly Asn Asp 
65 70 75 80 

lie Glu Thr Gly Leu Vai Val Ser Pro Glu Lys Phe Ala Leu Ser Pro 

85 90 95 

Gly Thr Lys Lys Thr He Arg Val He Ser Thr Gin Ala Pro Glu Arg 
100_ 105 _J± X ?_ 

Glu Glu Ala Trp Arg Val Tyr Phe Glu Ala Val Pro Glu Leu Glu Asp 
115 120 125 

Asp Pro Gin Ala Gly Gly Lys Gin Asn Ser Ser Val Ser Val Asn Leu 
130 135 140 

Val Trp Gly Val Leu Leu Arg Val Ser Pro Ser Asp Pro Arg Pro Ala 
145 150 155 160 

Leu Val Thr Asp Gly His His Leu Leu Asn Thr Gly Asn Thr Arg Leu 

165 170 175 

Ser Leu He Arg Ala Gly Asn Cys Asp Thr Thr Cys His Trp Gin Asn 
180 185 190 

lie Gly Lys Ser lie Tyr Pro Gly Gly Ser Ala Asp He Pro Ala Gly 
195 200 205 

lie Lys Ser Asn Ala Phe Arg Val Glu Tyr Arg Thr Gly Ala Asn Ser 
210 215 220 

Pro Val He Ser Ala Asp Leu Thr Ala Ala Gly Lys 
225 230 235 



<210> 8 
<211> 191 
<212> PRT 

<213> Salmonella typhi 
<400> 8 

Met Tyr Thr Glu Cys Thr Tyr He Thr Val He Asn Asn Lys Ala Arg 
15 10 15 

Leu Phe Phe Met Asn Met Lys Thr Ser Phe He Ala Ala Ala Val Ala 
20 25 30 



Leu Ala Thr Val Tyr Ser Phe Ser Val Ser Ala Val Gin Lys Asp He 
35 40 45 



Thr Val Thr Ala Asn lie Asp Ser Thr Leu Glu Leu Leu Gin Ala Asp 
50 55 60 



Gly Ser Ser Leu Pro Ser Thr Met Lys Leu Asp Phe Met Pro Gly Lys 
65 70 75 80 

Gly Leu Val His Lys Ser Leu Gin Thr Arg Leu Tyr Ser Asn Asp Gin 

85 90 " 95 

Thr Lys Ser Val Asn Val Lys Leu Leu Asn Ala Pro Gin Leu lie Asn 
100 105 110 

Val Leu Asp Pro Thr Lys Thr He Asp Met Glu Val Thr Leu Gly Gly 
115 120 125 

Arg Ser Leu Thr Thr Thr Asn Ser Val Leu Glu Ala Lys Thr Leu Phe 
130 135 140 

Pro_Asp Gly LysJThr Gly Asp Ala Ser Ala Leu Leu Asn Leu Asp lie 
145 150 155 160 

Gly Gin Lys Ala Gly Ala Ala Leu Gin Asn Leu Pro Ala Gly Glu Tyr 

165 170 175 

Ser Gly Leu Val Ser Leu Val He Ser Gin Ala Val Thr Ala Gly 
180 185 190 



<210> 9 
<211> 889 
<212> PRT 

<213> Salmonella typhi 
<400> 9 

Met Tyr Tyr Leu Leu Gly Leu Cys Ser Phe Thr Ser Gin Ala Thr Leu 
1 5 10 15 

He Pro Pro Pro Gly Phe Glu Ser Leu Leu Glu Gly Gin Thr Glu Gin 
20 25 30 

He Glu Val Leu Leu Pro Gly His Ser Leu Gly Leu Phe Pro Val Val 
35 40 45 



Val Lys Pro Asp Thr Val Gin Phe Met Ser Pro Leu Met Val Leu Glu 
50 55 60 

Ser Ser Gly Leu Ala Ala Leu Pro Ala Ala Glu Arg Gin Lys Ala Leu 
65 70 75 80 

Ala Ala Leu Ser Arg Pro Leu Leu Arg Asn Ser Asn Leu Val Cys Gly 

85 90 95 

Val Ser Glu Ala Lys Asp Ser Ser Glu Cys Gly Tyr Val Ala Thr Asp 
100 105 110 



Lys Glu Asp Val Ala Val He Phe Asp Glu Asn Asn Ala Gin Leu Ser 
115 120 125 



Leu Phe Leu Asn Arg Asp Trp Leu Pro Asp Glu Glu Arg Arg Asp Lys 
130 135 140 



Arg Trp Leu Thr Pro Thr Pro Glu Gly Val Ser Ala Phe He His Arg 
145 150 155 160 

Gin Thr Leu Tyr Leu Ser Asp Asp Leu His Ser Arg Asn Met Thr Leu 

165 170 " ~ 175 

Asn Gly Ser Gly Ala Leu Gly Leu Gly Asp Gly Arg Tyr Leu Gly Gly 
180 185 190 

Asp Trp Ala Ala He Trp Asn Gin Ser Glu His Tyr Asn Asn Ser Gin 
195 200 205 

Ala Trp Phe Asp Asn Leu Phe Val Arg Gin Asp Leu Gly Asn Gin Tyr 
210 215 220 

Tyr Leu Gin Ala Gly Arg Met Asp Gin Arg Asn Leu Ser Ser Ala Thr 

22 5 2_3 0_ _ 23 5 . ^_ 240 _ 

Gly Gly Asp Phe Gly Phe Ser Leu Leu Pro Leu Ser Arg Phe Asp Gly 

245 250 255 

Leu Arg Thr Gly Thr Thr Gin Ala Tyr Val Asn His Glu Val Asp His 
260 265 270 

Asn Ala Thr Pro Val Met Val Gin Val Thr Arg Asn Ala Arg He Asp 
275 280 285 

He Tyr Arg Gly Ser Glu Leu Leu Gly Ser Gin Phe Leu Thr Pro Gly 
290 295 300 

Met His Thr Leu Asp Thr His Ser Leu Pro Pro Gly Ser Tyr Pro Leu 
305 310 315 320 

Ala Leu Arg Val Tyr Glu Asp Gly He Leu Arg Arg Thr Glu Thr Gin 

325 330 335 

Pro Phe Ser Lys Gly Gly Asn Ser Phe Ser Ala Gin Thr Gin Trp Phe 
340 345 350 

lie Gin Gly Gly Leu Glu Asp Thr Gly Asp Lys Ala Ser His Tyr Asp 
355 360 365 

Gly Glu Thr Val Met Ala Ala Gly Phe Gin Thr Gly Leu Arg Lys Asn 
370 375 380 

lie Ser Leu Thr Glu Gly He Ser Leu Ala His Glu Ala Trp Tyr Ser 
385 390 395 400 

Glu Thr Arg Leu Asn Ser Gin His Ala Val Leu Asp Gly Thr Leu Asp 

405 410 415 

Leu Ser Ala Gly lie Leu His Gly Thr Asp Ser Thr Ser Gly Asn Thr 
420 425 430 



Glu Gin Val Thr Tyr Asn Asp Gly Phe Ser Ala Ser Leu Trp Arg Asn 
435 440 445 



His Thr Glu Ser Asp Ala Cys Ser Gly Arg His Pro Gin Ser Val His 
450 455 460 



Ala Ser Met Thr Cys Gin Thr Ser Met Asn Ala Ser Leu Ser Val Ser 
465 470 475 480 

Val Gly Asn Trp Tyr Ala Leu Leu Gly Tyr Ser Thr Ser Arg Thr Glu 

4 85 4 90 " " 4 95 

Gly Arg Pro Val Tyr Arg Gly Tyr Asp Asp Asn Ser Asp Lys Glu Asn 
500 505 510 

Val Phe Trp Arg Gin Ala Tyr He Pro Ala Ser His Arg Glu Ser Ala 
515 520 525 

Gin Ala Ser Ala Thr Tyr Ser Leu Asn Met Ala Gly Met Asn He Asn 
530 535 540 

Thr His Gly Gly Val Trp Arg Thr Arg Asn Asp Gly Val Asn Asp Asp 

_54 5_ -55-0 „_ _55 5 56 0.. 

Gly Leu Phe Met Ser Val Ser Val Ser Tyr Ala Ser Gin Pro Pro Thr 

565 570 575 

Met Thr Gly Ser Asn Arg Tyr Thr Ser Ala Gly Thr Asp He His Ser 
580 585 590 

Ser Arg Asn Gin Lys Thr Gin Thr Ser Trp Asn Val Asn His Val Arg 
595 600 605 

Ser Trp Gin Gin Asp Leu Tyr Arg Glu Leu Ser Val Gly Phe Ser Gly 
610 615 620 

Tyr Asn Asp Asp Ser Trp Ser Gly Ser Leu Gly Gly Arg Met Ser Gly 
625 63 0 63 5 64 0 

Arg Met Gly Glu Leu Ser Ala Thr He Ser Asn Ser His Gin Arg Asn 

645 650 655 

Ala Gly Ser Ala Ser Ser Leu Thr Ala Gly Tyr Ser Ser Ser Leu Ala 
660 665 670 

Leu Ser Arg Asn Gly Leu Phe Trp Gly Gly Gly Gin Asp Gly Glu Pro 
675 680 685 

Ala Ser Gly Met Ala Val Asn Val Glu Ser Glu Gly Asp Glu Gly Ser 
690 695 700 

Ser Gly Lys Val Val Ser Val Arg Gly Ser Ser Gin Pro Phe Ser Leu 
705 710 715 720 

Gly Phe Gly Gin Gin Ser Leu Leu Leu Met Glu Gly Tyr Asn Ala Thr 

725 730 735 

Glu Val Thr He Glu Asp Ala Gly Val Ser Ser Gin Gly Met Ala Gly 
740 745 750 



Val Lys Ala Gly Gly Gly Ser Arg Cys Tyr Phe Leu Thr Pro Gly His 
755 760 765 



5 



Leu Leu Val His Asn He Ser Ala Ser Met Ser Arg Leu Tyr Val Gly 
770 775 780 

Arg Val Leu Asp Lys Asp Gly Arg Pro Leu Leu Asp Ala Gin Pro Leu 
785 790 795 800 

Asn Tyr Pro Phe Leu Ser Leu Gly Pro Ser Gly Arg Phe Ser Leu Gin 

805 810 ' ~* ' 815 

Ser Glu His Lys Glu Ser Ser Leu Trp Leu Leu Ser Lys Asn Arg He 
820 825 830 

Leu Arg Cys Pro Met Ser Val His Lys Arg Arg Asp Val Met Gin Val 
835 840 845 

Val Gly Asp Val Arg Cys Glu Leu Ser Asp Val Asp Ala Leu Pro Gin 
850 855 860 

Ala Leu Gin He Ser Pro Arg Val lie Arg Leu Leu Asn Val Ala Gly 

865 87 0 ___875 880 

Leu Leu Arg His Ser Val Gin Glu Ala 

885 



<210> 10 
<211> 359 
<212> PRT 

<213> Salmonella typhi 
<400> 10 

Met Ser Asn Lys Met Lys Trp Thr Ser Met Thr Ala His Trp Ser Ala 
15 10 15 

He He Asn Phe lie Arg Lys Tyr Val Tyr Pro Ala Arg He lie Ala 
20 25 30 

lie Leu Leu Met Ala Gly Ala Thr Leu Pro Gin Val Ala Asp Ala He 
35 40 45 

Thr Val Asp Leu Asn Tyr Asp Lys Asn Asn Val Ala Val He Thr Pro 
50 55 60 

Val Trp Ser Gin Glu Trp Ser Val Ala Asn Val Leu Gly Gly Trp Val 
65 70 75 80 

Cys Arg Ser Asn Arg Asn Glu Asn Glu Gly Ala Cys Glu Glu Thr His 

85 90 95 

Leu Val Trp Trp Tyr Ala Phe Gly Ala Tyr Ser Lys He Arg Leu Arg 
100 105 HO 

Phe Arg Glu Gin lie Ser His Ala Glu He Thr Leu He Leu Leu Gly 
115 120 125 

Ser Val Arg Asp Ala Cys Tyr Thr Gly Val He Asn Met Asn Ala Ala 
130 135 140 



Ala Cys Gin Trp Gly Arg Ser Leu Lys Leu Arg He Pro Ser Glu Glu 
145 150 155 160 



Leu Ala Lys lie Pro Thr Ser Gly Thr Trp Lys Ala Thr Leu Val Leu 

165 170 175 



Asp Tyr Leu Gin Trp Gly Gly Asp Asp Pro Leu Gly Thr Ser Thr Thr 
180 185 190 

Asp lie Thr Leu Asn Val Thr Asp His Phe Ala Glu Asn Ala Ala He 
195 200 205 

Tyr Phe Pro Gin Phe Gly Thr Ala Thr Pro Arg Val Asp Leu Asn Leu 
210 215 220 

His Arg Met Asn Ala Ser Gin Met Ser Gly Arg Ala Asn Leu Asp Met 
225 230 235 240 

Cys Leu Tyr Asp Gly Gly Val Lys Ala Arg Ser Leu Gin Met Lys He 

245 250 255 

.Glu Gly Ser. Asn.Lys Ser _Gly Thr Gly^Phe Gin Val_Ile Lys Ser_Asp_ 
260 265 270 

Ser Ala Asp Thr lie Asp Tyr Ala Val Ser Met Asn Tyr Gly Gly Arg 
275 280 285 

Ser He Pro Val Thr Arg Gly Val Glu Phe Ser Leu Asp Asn Val Asp 
290 295 300 

Lys Ala Ala Thr Arg Pro Val Val Leu Pro Gly Gin Arg Gin Ala Val 
305 310 315 320 

Arg Cys Val Pro Val Pro Leu Thr Leu Thr Thr Gin Pro Phe Asn He 

325 330 335 

Arg Glu Lys Arg Ser Gly Glu Tyr Gin Gly Thr Leu Thr Val Thr Met 
340 345 350 

Leu Met Gly Thr Gin Thr Pro 
355 



<210> 11 
<211> 151 
<212> PRT 

<213> Salmonella typhi 
<400> 11 

Met Thr Val Tyr Leu Asp Asp Lys Asp Lys Glu Leu Leu Lys Glu He 
1 5 10 15 

Gin Lys Asp Cys Ala Gin Thr Leu Trp Gin Leu Ala Tyr Lys Val Gly 
20 25 30 

Leu Thr Pro Thr Pro Cys Phe Lys Arg Leu Lys Lys Leu Lys Asp Arg 
35 40 45 



Gly Val He He Gly Gin Phe Ala Leu Leu Asp Lys Glu Lys Leu Gly 
50 55 60 



Leu Ser Leu Asn Val Ptie lie Met He Asn He Ser Glu Glu Gin Tyr 
65 70 75 80 



Ala Ser He Ser 



Tyr Arg He Ser 
100 

Asp Met Asn Asp 
115 

Ser Ser He Ser 
130 

Glu Thr Asn Glu 
145 



Glu Lys He Lys 
85 

Gly Ser Phe Asn 



Tyr Tyr Ser Phe 
120 

Gly Ser Ala Ser 
135 

Leu Ser Val 
150 



Ser Met Pro Glu 
90 

Tyr Leu Met His 
105 

Tyr Glu Lys He 



Ser Phe Val Leu 
140 



Val He Ala Phe 
95 

Thr Val Phe Thr 
110 

He Leu Thr Asn 
125 

Glu Gin lie Lys 
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AMENDMENT 

In the Claims : 

Please amend claims 10 and 1 1 as follows. 

10. (Amended) A method for vaccinating a mammal against diseases 
caused by Salmonella enterica subspecies I, comprising administering a vaccine according 
to claim 4 to said mammal. 

1 1 . (Amended) A method for vaccinating a mammal against diseases 
caused by Salmonella enterica subspecies I serovar Typhi, comprising administering a 
vaccine according to claim 5 to said mammal. 

Please add the following new claims. 

15. (New) A method for vaccinating a mammal against diseases caused by 
Salmonella enterica subspecies I, comprising administering a vaccine according to claim 6 
to said mammal. 

16. (New) A method for vaccinating a mammal against diseases caused by 
Salmonella enterica subspecies I, comprising administering a vaccine according to claim 8 
to said mammal. 

17. (New) A method for vaccinating a mammal against diseases caused by 
Salmonella enterica subspecies I serovar Typhi, comprising administering a vaccine 
according to claim 7 to said mammal. 
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18. (New) A method for vaccinating a mammal against diseases caused by 
Salmonella enterica subspecies I serovar Typhi, comprising administering a vaccine 
according to claim 9 to said mammal. 
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VERSION WITH MARKINGS TO SHOW CHANGES MADE 

In the Claims : 

Claims 10 and 11 have been amended as follows. 

10. (Amended) A method for [protection] vaccinating a mammal against 
diseases caused by Salmonella enterica subspecies I, comprising administering a vaccine 
according to [any of claims] claim 4[, 6, and 8] to said mammal . 

1 1 . (Amended) A method for [protection] vaccinating a mammal against 
diseases caused by Salmonella enterica subspecies I serovar Typhi, comprising 
administering a vaccine according to [any of claims] claim 5[, 7, and 9] to said mammal . 

New claims 15-18 have been added. 

—15. (New) A method for vaccinating a mammal against diseases caused by 
Salmonella enterica subspecies I, comprising administering a vaccine according to claim 6 
to said mammal. 

16. (New) A method for vaccinating a mammal against diseases caused by 
Salmonella enterica subspecies I, comprising administering a vaccine according to claim 8 
to said mammal. 

17. (New) A method for vaccinating a mammal against diseases caused by 
Salmonella enterica subspecies I serovar Typhi, comprising administering a vaccine 
according to claim 7 to said mammal. 
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18. (New) A method for vaccinating a mammal against diseases caused by 
Salmonella enterica subspecies I serovar Typhi, comprising administering a vaccine 
according to claim 9 to said mammal.— 
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REMARKS 

Introductory Comments : 

The present application is a 35 U.S.C. §371 filing of International Applicaiton 
Number PCT/SE00/01079, filed 26 May 2000. 

Overview of the Amendments : 

Applicants, by way of this preliminary amendment, have entered minor claim 
amendments and four new claims for consideration herein. Applicants respectfully request 
that these amendments be entered prior to calculation of the filing fees, and prior to 
examination of the present application on the merits. 

More particularly, applicants have amended claims 10 and 1 1 merely to remove 
multiple dependencies and to recite the invention with greater particularity. Support for 
these claim amendments can be found in the claims as originally filed and throughout the 
specification, for example, at page 6, line 15 through page 7, line 1 1. In addition, applicants 
have submitted herewith new claims 15-18. Support for these new claims can be found 
throughout the specification and claims, particularly in claims 10 and 11 as originally filed 
and in the specification at page 6, line 15 through page 7, line 11. Accordingly, applicants 
submit that no new matter has been added by way of the above-described claim 
amendments and the new claims, and the entry thereof is respectfully requested. 

The claim amendments and the new claims have been provided in both "clean 
version" and in "marked-up version" in conformance with 37 C.F.R. §1. 121(c). The 
"marked-up version" of these amendments and new claims shows the changes made to the 
claims by the current amendment. The attached pages are captioned " VERSION WITH 
MARKINGS TO SHOW CHANGES MADE". 
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CONCLUSION 



Applicants respectfully submit that the claims define an invention which complies 
with the requirements of 35 U.S.C. § 112 and which is novel and nonobvious over the art. 
Accordingly, allowance is believed to be in order and an early notification to that effect 
would be appreciated. 
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Telephone: (510) 742-9700, ext. 209 

Fax: (510)742-9720 



Respectfully submitted, 
SBL Vaccin AB 



Date: 



22 November 2001 




Thomas P. McCracken 
Registration No. 38,548 
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of the invention 

The present invention is based on the finding that two fimbrial operons, the saf 
operon and the tcf operon, are specific for Salmonella enterica subspecies 1 
bacteria and therefore have therapeutic use. Due to their specificity they can be 
5 used to provide vaccines against Salmonella enterica subspecies I as well as for 
detection of Salmonella enterica subspecies I. The saf operon is specific for all 
Salmonella enterica subspecies 1 bacteria and the tcf operon is specific for the 
serovar Typhi of Salmonella enterica subspecies 1. 

0 All or part of the DNA-sequences of the genes encoding these proteins can be 
used as active agents in a vaccine against diseases caused by the Salmonella 
' enterica subspecies I bacterial strains or for detection of said bacterial strains. 

The present invention also relates to methods of isolating these fimbrial 
5 proteins, to antibodies directed against these proteins, and to a vaccine 

composition comprising these proteins or antibodies directed against these 
proteins for use in the treatment of infections caused by the Salmonella spp. 
The fimbrial proteins according to the invention or antibodies directed against 
them can be used for detection of Salmonella spp. bacteria. 

20 

Background of the invention 

The members of genus Salmonella spp colonize and infect a wide range of 
different organisms. Many cause gastroenteritis and enteric fever in humans 
and domesticated animals while others are not associated with human disease 
25 (Saylers et al, 1994). The genus has been divided into two species, Salmonella 
bongori and Salmonella enterica where enterica can be further subdivided into 
seven subspecies, designated I, II, Ilia, IHb, IV, VI, and VII (Reeves et al, 1989). 
Salmonella enterica subspecies I are preferentially associated with warm- 
blooded animals. Over 99% of all clinical Salmonella isolates are strains 
30 belonging to this subspecies, including serovars Typhimurium and Enteritidis, 
which are the major causes of Salmonella induced gastroenteritis in humans, 
and Typhi, the human specific causative organism of typhoid fever, the most 
severe form of human salmonellosis (Popoff et al, 1992). 

35 Salmonella enterica subspecies I consists of over 1300 different serovars and is 
preferentially associated with warm-blooded animals (Baumler, 1997). Over 
99% of all clinical Salmonella isolates are strains belonging to this subspecies, 
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including serovars Typhimurium and Enteritidis, which are the major causes of 
Salmonella induced gastroenteritis in humans, and Typhi, the human specific 
causative organism of typhoid fever, the most severe form of human 
salmonellosis (Popoff and Le Minor, 1992). 

Today gastroenteritis and enteric fever can neither be prevented nor treated 
with good results. Typhoid fever is a substantial public health problem in 
developing countries. Each year 33 million people become ill and over 500 000 
people die from this infection (American Institute of Medicine, 1986). Typhoid 
fever can be prevented by vaccination with attenuated bacteria, such as Ty21 
and Vi vaccines and whole cell vaccines. Whole cell vaccines show a high 
incidence of side effects (Ashcroft et al, 1964, Yugoslav Typhoid commission, 
1964). The vaccines consisting of attenuated strains of Salmonella typhi suffer 
from serious drawbacks. They must be administered as three or four spaced 
doses in order to stimulate protective immune responses (Levine et al, 1989). 
The treatment of Salmonella typhi with antibiotics is jeopardized since there 
are strains of Salmonella typhi that are resistant to chloramphenicol, 
ampicillin, and trimethoprim as well as ciprofloxacin (i.e. multidrug-resistant 
strains) (Rowe et al, 1997). 

Accurate detection of Salmonella enterica subspecies I is today not possible. 
Salmonella enterica subspecies I can today only be detected by antibodies 
directed against surface proteins of Salmonella enterica subspecies I. The use of 
the sequences according to the invention makes it for the first time possible to 
rapidly and accurately determine the presence of Salmonella enterica 
subspecies I. 

For many pathogenic bacteria, there is evidence that the filamentous surface 
protein structures called pili (fimbriae) are connected to the adhesion of the 
bacteria to the host cells. Pili proteins are very antigenic and are easily 
purified. Therefore pili preparations have been used as antigens for 
vaccination. 



Summary of the invention 

The invention relates to the objects as defined in the claims. The main objec 
the present invention is to provide two fimbrial proteins that are specific for 
Salmonella enterica subspecies I bacterial strains, the nucleotide sequences 
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encoding said proteins, as well as the corresponding amino acid sequences of 
for therapeutic and diagnostic use. Further are recombinant microorganisms 
provided, in which the nucleotide sequences according to the invention have 
been inserted. 

An object of the present invention is to provide vaccine compositions for use in 
the treatment of Salmonella enterica infective strains, essentially pure Saf and 
Tcf fili protein of Salmonella enterica subspecies I and Salmonella enterica 
subspecies I serovar Typhi, respectively, as well as antibodies directed to these 
fili proteins. 

A further object of the present invention is to provide the DNA sequences of the 
genes encoding the Saf and Tcf proteins. These sequences can be used for 
recombinant production of the proteins and for the preparations of vector 
vaccines against Salmonella enterica subspecies 1 and Salmonella enterica 
subspecies 1 serovar Typhi, respectively, as well as for diagnostic purposes. 

Yet another object of the present invention to use purified Saf and Tcf protein 
from Salmonella enterica subspecies 1 bacteria for active or passive 
immunization of mammals, i.e. the proteins according to the invention can be 
comprised in a vaccine composition or be used to raise antibodies which can be 
comprised in a vaccine composition. 

Finally, an object of the present invention is to provide a method for preventing 
or reducing the possibility of Salmonella infection of a mammal by 
administering the vaccines according to the invention. The invention may be 
more fully understood by reference to the following drawings and detailed 
description. 

Brief description of the drawings 
Figure 1. 

Schematic representation of phage clones (named N10, Dl, Bl, Fll) covering 
the entire cs7 insert of Salmonella enterica serovar Typhimurium strain 
SR^3181, i.e. comprising the saf fimbrial operon, i.e. safA, B, C and D 
(SEQ ID NO 1). 

The clones were selected from partial Eco RI and BamHI libraries in the 
Lambda Dash II vector. The cs7 insert is represented by a bold line. The extent 
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of respective phage insert is represented by horizontal bars. Name and size of 
the phage inserts are indicated on the left side of the figure. 
Figure 2: Schematic representation of the pTY52 cosmid comprising the tcf- 
operon (SEQ ID NO 2). 
5 A tcf specific PCR fragment of 1 1 105 bp was cloned into the Expand vector I 
cosmid (Roche). The insert is represented with a thick black line while vector 
sequences are represented with thin lines. Relevant restriction sites sequences 
are indicated. The position of the tc/^operon, i.e. tcfA, B, Cand D (SEQ ID NO 
2), is represented by a shaded arrow. 
10 Figure 3: The phylogenetic distribution of the identified genes on the cs7 insert 
was investigated using the well defined SARC collection , see Example 1. 
Figure 4: A 2 kb large internal EcoR I fragment was used as a probe in a 
Southern blot of the SARC collection, see Example 2. 

15 Sequence listing 

Sequence Listing No. 1, herein referred to as SEQ ID NO 1, — DNA sequence of 
the genes encoding the precursor of the saf fimbrie unit of Salmonella enterica 
subspecies I. 

Sequence Listing No. 2, herein referred to as SEQ ID NO 2, — DNA sequence of 
20 the genes which encode the precursor of the tcf fimbrie unit of Salmonella 
enterica subspecies I serovar Typhi. 

Deposit information 

The phages carrying the inserted SEQ ID NO 1, i.e. phages clones Bl, Dl, Fl 1 
25 and N10 (see Figure 1) have been given the ECACC Accession numbers 
99051922, 99051923, 99051924, and 99051925, respectively. 
The cosmide carrying the inserted SEQ ID NO 2, i.e. cosmide pTY52 (see Figure 
2) has been given the ECACC Accession number 99051926. 
The depositions were made May 19, 1999. 

30 

Detailed description of the invention 

The present invention is based on the finding that two fimbrial operons, the saf 
operon and the tcf operon, are specific for Salmonella enterica subspecies 1 
bacteria. Due to their specificity they can be used to provide vaccines against 
35 Salmonella enterica subspecies I as well as detection methods for Salmonella 
enterica subspecies I. The saf operon is specific for all Salmonella enterica 
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subspecies 1 bacteria and the tcf operon is specific for the serovar typhi of 
Salmonella enterica subspecies 1, see Examples 1 & 2. 

The main object of the invention relates to two fimbrial operons, the saf operon 
and the tcf operon, that are specific for Salmonella enterica subspecies 1 
bacteria for terapeutic use. 

Another object of the present invention is to provide vaccines against 
Salmonella enterica subspecies 1 induced gastroentritis, entric fever and 
typhoid fever. 

A further object of the present invention is to provide methods to detect 
Salmonella enterica subspecies 1. The nucleotide sequences according to the 
invention are useful for constructing vectors for use as vaccines for insertion 
into attenuated bacteria in constructing a recombinant vaccine, for insertion 
into a viral vector in constructing a recombinant viral vaccine, or for direct 
inoculation as a nucleic acid vaccine. The pili proteins according to the 
invention, or antigenic fragments thereof, can be used for active immunization 
and antibodies directed against them can be used for passive immunization. All 
these applications of the sequences according to the invention are obtained by 
applying standard techniques known to the man ordinary skilled in the art. 

Vaccines against Salmonella enterica subspecies L 

The genes encoding the saf and tcf fimbrial structures, or fragments thereof, 
may be incorporated into a bacterial or viral vaccine comprising recombinant 
bacteria, virus or fungi which are engineered to produce one or more 
immunogenic epitopes of the saf or tcf fimbrial structures. In addition, the 
genes encoding the saf and tcf fimbrial structures, or part thereof, operatively 
linked to regulatory elements, can be introduced directly as a nucleic acid 
vaccine, to elicit a protective immune response. 

The proteins or antigenic fragment thereof, deduced from the nucleic acid 
sequences of the present invention are useful alone or in conventional vaccine 
mixtures in the vaccine compositions according to the invention. The proteins 
could be produced by chemical synthesis or recombinant expression according 
to conventional methods. 
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The proteins and peptides according to the invention can be obtained by using 
a host organism transformed or transfected with an expression vector obtained 
by insertion of a gene according to the invention, or part thereof, into a vector 
in a conventional manner. The vector which is used to construct the expression 
vector is not particularly limited, but specific examples include plasmids such 
as pET (Stratagen) and the like; and phages such as M13 (NEB), phage display 
libraries and the like. As expression regulatory sequence can among others T7 
promotors and lac promotors be used. 

An appropriate host to be transformed or transfected with the expression vector 
can be chosen among for example E.-coli, Salmonella or Bacillus subtilus. The 
transformed or transfected host is cultured and proliferated under suitable 
conditions. 

After culturing, the peptides of the present invention may be purified by, for 
example, chromatography, precipitation, and/or density gradient 
centrifugation. The thus obtained peptides can be used as a vaccine or for the 
production of antibodies directed against said peptides, which can be used for 
passive immunization. 

The purified preparation containing one or several proteins according to the 
invention, or parts thereof, is then formulated as a pharmaceutical 
composition, as for example a vaccine, or in a mixture with adjuvants. If 
desired the proteins are fragmented by standard chemical or enzymatic 
techniques to produce antigenic segments. 

In formulating the vaccine compositions with the peptide or protein, alone or in 
various combinations, the immunogen is adjusted to an appropriate 
concentration and formulated with any suitable vaccine adjuvant. The 
immunogen may also be incorporated into liposomes, or conjugated to 
polysaccharides and/ or other polymers for use in a vaccine formulation. 

The different vaccines according to the present invention are administered to 
mammals in many different ways. These include intradermal, intramuscular, 
intraperitoneal, intravenous, subcutaneous, oral, and intranasal routes of 
administration. The vaccine doses will differ depending on circumstances such 
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as body weight, interferences with other administered medicaments etc. The 
upper limit is not critical unless the dose shows toxicity. 

The peptides and proteins of the present invention are also useful to produce 
5 monoclonal or polyclonal antibodies for use in passive immunotherapy against 
Salmonella enterica subspecies 1. Human immunoglobulin is preferred. 
Antisera is obtained from individuals immunized with proteins or peptides 
according to the invention. The immunoglobulin fraction is then enriched, for 
example by immunoaffininty or affininty chromatography. Antibodies raised in 
10 a suitable mammal or in the patient to be treated, can subsequently be 
administered locally or topically, e.g. orally to the patient. 

Detection of Salmonella enterica subspecies I in general. 
The sequences according to the invention, or part thereof, or fragments 
15 hybridizing therewith, as well as the proteins according to the invention, or part 
thereof, and antibodies directed to said proteins, or antigenic fragments 
thereof, can be used in molecular diagnostic assays for the detection of 
Salmonell enterica subspecies I. 

20 Nucleic acids having the nucleotide sequence according to the invention, or any 
nucleotide sequence hybridizing therewith can be used as a probe in nucleic 
acid hybridization assays for the detection of Salmonella spp in various tissues 
and body fluids of patients. The hybridization assay may be of any type 
including; Southern blots, Northern blots, colony blots. 

25 

PCR technology is the most preferred technology for detection according to the 
invention of Salmonella enterica subspecies 1 . Primers of at least one selected 
from the 5' end and one from the 3' end can be used in PCR and other known 
tests to rapidly identify the presence of Salmonella enterica subspecies 1. This is 
30 according to conventional techniques. 

The isolated and purified proteins and peptides of the invention can be used as 
diagnostics to measure an increase in serum titer of Salmonella enterica 
subspecies I-specific antibody since they bind strongly to these antibodies. A 
35 serum test sample can be screened for Salmonella enterica subspecies I by 
methods such as for example ELISA. 
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The invention further comprises the use of antibodies directed against the saf 
and tcf fimbrie structures for quantitative or qualitative determinations of the 
pili proteins of the invention, or fractions thereof, in cells, tissues or body 
fluids. 

Detection of Salmonella enterica subspecies I by using nucleic acid 
hybridization technology 

Nucleic acid hvbridization technology can also be to detect Salmonella enterica 
subspecies 1 according to the invention. The nucleic acid probes chosen from 
parts of the sequences according to the invention can be either DNA or RNA. 
DNA sequences complementary to the sequences according to the invention 
can also be used. The binding of the probe to the target sequence, i.e. the 
hybridization, must not be perfect. Variations and mutations of the sequences 
according to the invention can be used as long as they hybridize good enough 
to detect Salmonella enterica subspecies L The preferred length of the nucleic 
acid probes is about 10 to 400 nucleotides, most preferred not longer than 100 
nucleotides. 

The nucleotide probe is preferably chosen from the parts of the sequences that 
have the least variation. In the most preferred embodiments when screening for 
SEQ ID NO 1 (the sa/operon, specific for Salmonella enterica subspecies 1) a 
nucleotide probe or PCR primer selected from nucleotides 37 368-37 868 
should be avoided since this region is hypervariable. 

The nucleic acid probes according to the invention are prepared by any 
conventional method such as organic synthesis, recombinant techniques, or 
isolation from genomic DNA. 

The nucleic acid probes of the invention are labeled in a conventional manner 
to signal hybridization to target nucleic acid from Salmonella enterica 
subspecies I. The labeling may comprise a radiolabel, an enzyme, a bacterial 
label, a fluorescent label, an antibody, an antigen, a latex particle, an electron 
dense compound, or a light scattering particle. 

The probes may be provided in a lyophilized form, to be reconstituted in a 
buffer appropriate for hybridization, or the probes may already be present in 
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such a buffer. The buffer may contain a suitable hybridization enhancer, 
detergent, carrier DNA, and a compound to increase the specificity. 

Any conventional hybridization assay technique, such as dot blot hybridization, 
Southern blotting, sandwich hybridization, displacement hybridization and the 
like, can be used. 

The target analyte polynucleotide of a microorganism may be in various media, 
most often in a biological, or physiological specimen. In most cases it is 
preferred to subject the specimen containing the target polynucleotide to any 
conventional extraction, purification, and/ or isolation before conducting the 
analysis. 

The sample containing the target analyte nucleotide sequence must often be 
treated to convert the DNA to a single-stranded form, which may be 
accomplished by a variety of conventional techniques, such as thermal or 
chemical techniques. 

The following examples describe the isolation and specificity of the sequences 
according to the invention. 

EXAMPLE 1 

Identification and characterization of the sa/operon. 

The present inventors found, upon investigation of a 7 kb chromosomal region 
on centisome 7 originally isolated from the S. typhimurium strain SR-11 k3181, 
a region that exhibits many of the traits that define a pathogenicity island. It 
has a lower G+C composition than the average composition of the Salmonella 
genome and includes many sequences related to different mobile genetic 
elements. The region is not present in E.coli K12, and the Salmonella specific 
DNA is inserted between the tRNA gene asp V and the stop codon of yajV, a 
hypothetical protein upstream of the j/q/Hgene at 5 min in the E.coli 
chromosome. This Salmonella specific insert encodes proteins creating adhesive 
structures and other virulence factors. Sequencing revealed genes encoding a 
new fimbrial operon that they designated Salmonella Atypical Fimbriae (so/), 
due to its relatedness to a subgroup of adhesive structures forming thin 
atypical fimbriae or non-fimbrial adhesins. 
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The saf operon consists of four contiguous genes, safA, safB, sq/C and safD 
that encode fimbrial subunit, periplasmic chaperone, outer membrane usher 
protein and alternative fimbrial subunit, respectively. The genes safA, B } C and 
sq/D encode putative proteins of 166, 244, 836 and 156 amino acids, 
respectively. Analyzes of clinical Salmonella isolates showed that DNA of 195 
out of 198 clinical isolates belonging to S. enterica subspecies I hybridized with 
safB and safC, i.e. these sequences are common to more than 99% of the 
known Salmonella enterica subspecies 1 bacteria. The inventors showed that 
58% of these clinical isolates carry the saf A, see Table 1. 
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Table 1 . The prevalence of the saf genes in clinical Salmonella isolates. 



Serovar 



safA 



safB 



safC 



# isolates 



S. adelaide 

S. agona 

S. anatum 

S. bareilly 

S. blockley 

S. bovisrnorbificans 

S. braenderup 

S. brandenburg 

S. bredeney 

S. Chester 

S. cohndale 

S. derby 

S. dubhn 

S. eastboume 

S. emek 

S. enteritidis 

S. give 

S. goettingen 
S. haardi 
S. hadar 
S. Heidelberg 
S. hvittingfoss 
S. infantis 
S. Java 
S. javiana 
S. kottbus 
S. livingstone 
S. london 
S. maastricht 
S. mbandaka 
S monteiAdeo 
S. muenster 
S. newport 
S. ohio 

S. oraraenburg 
S. panama 
S. potsdam 
S. rissen 
S. saarbrucken 
S. saint paul 
S. schLuartzengrund 
S. Singapore 
S. Stanley 
S. subsp I 4.5,12:-:- 
S. subsp 1 4.5,1 2:b:- 
S. subsp 14.5, 12±- 
S. subsp I spont 
S. tennessee 
S. thompson 
S. typhi 

S. typhimurium 
S. virchoiv 
S. welterureden 
S. uorthington 
S. subsp in 
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The phylogenetic distribution of the identified genes on the cs7 insert was 
investigated using the well defined SARC collection, which showed that the 
presence of the safA, safB, safC and safD genes is restricted to S. enterica 
subspecies I (Fig. 3). This region is hence the first subspecies I specific genetic 
region to be identified with a broad distribution within the subspecies. Since 
the serovars of subspecies I constitute over 99% of human salmonellosis and 
are preferentially associated with warm blooded animals, it implicates a role for 
the saf adhesive organelle in the colonization of these organisms. 

EXAMPLE 2 

Identification and characterization of the ic/ operon. 

The present inventors found that Salmonella enterica subspecies I serovar Typhi 
contains DNA encoding an additional fimbrial operon. the tcf operon, in the 
sinR-pagN intergenic region. Southern blot analysis revealed a markedly 
different restriction pattern in S. enterica serovar Typhi than the other 
subspecies I isolates, suggesting that the safsin region in serovar Typhi might 
carry additional DNA relative to serovar Typhimurium strains. A PGR reaction 
(using a kit from Roche) was therefore performed using a sinR (5'-GTA AAT 
CGC TTA GTC GCC-3') specific forward primer and a pagN (5'-TCA ACT CAA 
CCT TCA GCC-3') specific reverse primer. 

This primer pair produced, as expected, a product of 2 kb in serovar 
Typhimurium from the SARC collection, while from serovar Typhi the product 
was 10 kb. Thus, the neighboring sinR and pagN genes in serovar 
Typhimurium strains are separated by approximately 8 kb in serovar Typhi. 

The Typhi specific PCR product was purified, digested partially with EcoRI and 
sub-cloned into pUC18 forming a set of overlapping clones. Sequencing of the 
clones revealed a putative fimbrial operon designated tcf for Typhi Colonizing 
Factor. Four ORFs, tcfA,B,C,D, have been identified with putative proteins 
having significant homology to CooB (38% identical over 192 aa), CooA (37% 
identical over 170 aa), CooC (34% identical over 872 aa) and CooD (31% 
identical over 272 aa), respectively. The Coo proteins are involved in the 
biosynthesis of the CS1 colonizing factor antigens of enterotoxigenic B.coli (Fig. 
4) (Froehlich et al., 1994). The peptide of the tcfB ORF is also homologous to 
the CblA major fimbrial subunit protein (45% identical over 154 aa) of the cable 
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type II pili of the cystic fibrosis-associated Burkholderia cepacza(Sajjan et aL, 
1995). Down-stream of the tc/^operon two ORFs were identified with the same 
transcriptional orientation as the tcf genes. The first was designated tinR for 
Typhi insert regulator because it is homologous (33% identical over 144 aa) to 

5 AzlB of Bacillus subtilis, a member of the Lrp/AsnC family of transcriptional 
regulators (Belitsky et al., 1997). tinR is followed by an ORF (tioA for Typhi 
insert orf) encoding a putative protein of 205 amino acids with no significant 
homologies to anything in the DDB J / EMBL/ GenBank databases. The above 
sequence from Salmonella enterica serovar Typhi strain RKS 3333 and the tcf 

10 region of the incomplete genome sequence from serovar Typhi strain CT18 ( 
http:/ / www.sanger.ac.uk) are 99% identical over the total length of the 
investigated region in concordance with the clonal nature of the serovar . 

A 2 kb large internal EcoR I fragment was used as a probe in a Southern blot of 
15 the SARC collection. This blot shows that Salmonella enterica subspecies I 
serovar Typhi (SARC2) is the only strain in the collection possessing DNA 
hybridizing to this fragment (Fig. 4). 
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Claims: 



1. Peptide encoded by a nucleotide sequence selected from Sequence Listing No. 
1 (SEQ ID NO 1) and Sequence Listing No. 2 (SEQ ID NO 2) for use in medicine. 

2. Antibodies directed against a peptide encoded by a nucleotide sequence se- 
lected from Sequence Listing No. 1 (SEQ ID NO 1) and Sequence Listing No. 2 (SEQ 
ID NO 2) for use in medicine. 



3. Nucleotide sequence selected from Sequence Listing No. 1 (SEQ ID NO 1) and 
Sequence Listing No. 2 (SEQ ID NO 2) for use in medicine. 

4. A vaccine for the protection against diseases caused by Salmonella enterica 
subspecies I, comprising a peptide encoded by a nucleotide sequence selected from 
Sequence Listing No. 1 (SEQ ID NO 1), or antibodies directed against said peptide 
and, optionally, a pharmaceutically acceptable carrier. 

5. A vaccine for the protection against diseases caused by Salmonella enterica 
subspecies I serovar Typhi, comprising a peptide encoded by a nucleotide sequence 
selected from Sequence Listing No. 2 (SEQ ID NO 2), or antibodies directed against 
said peptide and, optionally, a pharmaceutically acceptable carrier. 

6. A nucleic acid vaccine for the protection against diseases caused by Salmo- 
nella enterica subspecies I, comprising a nucleic acid sequence selected from Se- 
quence Listing No. 1 (SEQ ID NO 1) and, optionally, a pharmaceutically acceptable 
carrier. 



7. A nucleic acid vaccine for the protection against diseases caused by Salmo- 
nella enterica subspecies I serovar Typhi, comprising a nucleic acid sequence se- 
lected from Sequence Listing No. 2 (SEQ ID NO 2) and, optionally, a pharmaceuti- 
cally acceptable carrier. 

8. A vector vaccine for the protection against diseases caused by Salmonella en- 
terica subspecies I, comprising a host in which a recombinant vector comprising a 
nucleic acid sequence selected from Sequence Listing No. 1 (SEQ ID NO 1), has 
been inserted and, optionally, a pharmaceutically acceptable carrier. 
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9. A vector vaccine for the protection against diseases caused by Salmonella en- 
terica subspecies I serovar Typhi, comprising a host in which a recombinant vector 
comprising a nucleic acid sequence selected from Sequence Listing No. 2 (SEQ ID 
NO 2), has been inserted and, optionally, a pharmaceutical^ acceptable carrier. 



10. A method for protection against diseases caused by Salmonella enterica sub- 
species I, comprising administering a vaccine according to any of claims 4,6, and 
8. 

10 11. A method for protection against diseases caused by Salmonella enterica sub- 
species I serovar Typhi, comprising administering a vaccine according to any of cla- 
ims 5, 7, and 9. 

12. Antibodies directed against a peptide encoded by a nucleotide sequence se- 

15 lected from Sequence Listing No. 1 (SEQ ID NO 1) and Sequence Listing No. 2 (SEQ 
ID NO 2), for use in a diagnostic method. 

13. Peptide encoded by a nucleotide sequence selected from Sequence Listing No. 
1 (SEQ ID NO 1) and Sequence Listing No. 2 (SEQ ID NO 2), for use in a diagnostic 

20 method. 

14. Primers for, or probes that hybridize with, a nucleotide sequence selected 
from Sequence Listing No. 1 (SEQ ID NO 1) and Sequence Listing No. 2 (SEQ ID NO 
2), for use in a diagnostic method for the purpose of detecting Salmonella enterica 

25 subspecies I. 
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ABSTRACT 



The present invention is based on the finding that two flmbrial structures are 
specific for Salmonella enterica subspecies 1 bacteria. Due to their specificity 
they can be used to provide vaccines against Salmonella enterica subspecies I 
as well as for detection of Salmonella enterica subspecies I. 
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INED DECLARATION AND POWER OF ATTORNEY 
FOR UTILITY PATENT APPLICATION 



AS A BELOW -NAMED INVENTOR, I HEREBY DECLARE THAT: 

My residence, post office address and citizenship are as stated below next to my name. 

I believe I am the original, first and sole inventor (if only one name is listed below) or an original, 
first aad joint inventor (if more than one name is listed below) of the subject matter which is claimed 
and for which a patent is sought ^> the invention entitled: F1MBRIAL PROTEINS the specification 
of which 

X was filed on 26 November 2001 and assigned Serial No, 09/926,596 ; and 

X was initially filed in a PCT application designating the US on 26 May 2000 . and 
assigned Serial No. PCT/SE00/01079 ; 

X was amended under PCT Article 19 (35 U.S.C. §371(c)(3)) on 4 September 
2001 and on 13 September 2001; and 

X has was amended by the Preliminary Amendment dated 22 November 2001 . 

I HAVE REVIEWED AND UNDERSTAND THE CONTENTS OF THE ABOVE- 
IDENTIFIED SPECIFICATION, INCLUDING THE CLAIMS, AS AMENDED BY ANY 
AMENDMENT REFERRED TO ABOVE. 

I acknowledge and understand that I am an individual who has a duty to disclose information which 
is material to the patentability of the claims of this application in accordance with Title 37, Code of 
Federal Regulations, §§ L56(a) and (b) which state: 

(a) A patent by its very nature is affected with a public interest. The public interest 
is best served, and the nrr st effective patent examination occurs when, at the time 
an application is being examined, the Office is aware of and evaluates the teachings 
of all information material to patentability. Each individual associated with the filing 
and prosecution of a patent application has a duty of candor and good faith in 
dealing with the Office, which includes a duty to disclose to the Office all 
information known to that individual to be material to patentability as defined in this 
section. The duty to disclose information exists with respect to each pending claim 
until the claim is canceled or withdrawn from consideration, or the application 
becomes abandoned. Information material to the patentability of a claim that is 
canceled or withdrawn from consideration need not be submitted if the information 
is not material to the patentability of any claim remaining under consideration in the 
application. There is no duty to submit information which is not material to the 
patentability of any existing claim. The duty to disclose all information known to be 
material to patentability is deemed to be satisfied if all information known to be 
material to patentability of any claim issued in a patent was cited by the Office or 
submitted to the Office in the manner prescribed by §§ l_97(b)~(d) and L98. 
However, no patent will be granted on an application in connection with which 
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fraud on the Office was practiced or attempted or the duty of disclosure was 
violated through bad faith ox intentional misconduct. The Office encourages 
applicants to carefully examine: 

(1) prior art cited in search reports of a foreign patent office in a counterpart 
application, and 

(2) the closest information over which individuals associated with the filing 
Or prosecution of a patent application believe any pending claim patentabiy defines, 
to make sure that any material information contained therein is disclosed to the 
Office. 



(b) Under this section, information is material to patentability when it is not 
cumulative to information already of record or being made of record in the 
application, and 

(1) It establishes, by itself or in combination with other information, a prima 
facie case of uupatentab: l\ty of a claim; or 

(2) It refutes, or is inconsistent with, a position the applicant takes in: 

(i; Opposing an argument of unpatentability relied on by the Office, 

or 

(ii) Asserting an argument of patentability. 

A prima facie case of unpatentability is established when the information compels a 
conclusion that a claim is unpatentable under the preponderance of evidence, 
burden-of-proof standard, giving each term in the claim its broadest reasonable 
construction consistent with the specification, and before any consideration is given 
to evidence which may be submitted in an attempt to establish a contrary conclusion 
of patentability. 

I do not know and do not believe this invention was ever known or used in the United States of 
America before my or our invention thereof, or patented or described in any printed publication in 
any country before my or our invention thereof or more than one year prior to said application. This 
invention was not in public use or on sale in the United States of America more than one year prior 
to this application. This invention has not been patented or made the subject of an inventor's 
certificate issued before the date of this application in any country foreign to the United States of 
America on any application fik- 1 by me or my legal representatives or assigns more than six months 
prior to this application. 

I hereby appoint the following attorneys and agents to prosecute that application and to transact all 
business in the Patent and Trademark Office connected therewith and to File, to prosecute and to 
transact all business in connection with all patent applications directed to the invention: 
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Thomas P. McCracten, Reg. NoJ58^54j 
Roberta Robins, Reg. NqJJSJQS 
Sharon E. Crane, Reg. No. 36,113 



Address all correspondence to: Thomas P. Mci 

,PQ^E^^^J£a^^CTUTlCALS PLC 
J lQ reyJHoase^ 
The Ox ford Science Park 
OxrbrZo5c44f^r 
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United Kingdom 
Telephone: +44 1865 332 600 
Facsimile: +44 1865 332 601 

Address all telephone calls to: Thomas P. McCracken in the United Kingdom at +44 1865 332 
600. 

This appointment, including the right to delegate this appointment, shall also apply to the same extent 
to any proceedings established by the Patent Cooperation Treaty. 

I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and farther that these statements 
were made with the knowledge that willful false statements and the like so made are punishable by 
fine or imprisonment, or both, under § 1001 of Title IS of the United States Code and that such 
willful false statements may jeopardize the validity of the application or any patent issued thereon, 



Signature: Q^lS^<^ Da te 2002/0*02, 

Full Name of Inventor. Anders F OLKESSON 
Citizenship: SE 

Residence: Sweden . 
Post Office Address: Rasundavagen 169B, S-169 36 Soma, Sweden J^fj 



Signature: 

^^^^ C Date £ 

Full Name of Inventor: Staff an NORMAS K 
Citizenship: SE 
Residence: Sweden 

Post Office Address: Langangsvagen " 27, S-182 7 5 "Stocks tind", Sweden 





Signature: ( \ If^/W^ Date 2O0tO 7 0 Z 

Full Name of Inventor: Sven ifoFDARL 
Citizenship: SE 
Residence: Sweden 

Post Office Address: Adolfsbergsvagen 36, S-168 66 Bromm^Sweden Jf 
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SEQUENCE LISTING NO. 1 

<110> Folkesson, Anders 

<120> The complete sequence of the cs7 insert in Salmonella 
enteric serovar Typhi murium 

<130> Complete sequence of the cs7 insert 

<140> 
<141> 

<160> 5 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 46870 
<212> DNA 

<2 13 > Salmonella typhi murium 

<220> 
<221> CDS 

<222> (37368) - . (37868) 

<223> safA putative fimbrial subunit 

<220> 
<221> CDS 

<222> (37952) . . (38689) 

<223> safB putative periplasmic chaperone 

<220> 
<221> CDS 

<222> (38713) . - (41223) 

<223> safC putative outer membrane usher 

<220> 

<221> CDS 

<222> (41245) - - (41715) 

<223> safD putative fimbria! subunit 



<400> 1 
gatacaaatc 


tcagggtctt 


tttatacatc 


ctgtgaagta 


aaaaaaaccg 


tatcactgta 


60 


aaagggatac 


ggtttttttt 


cgtcttcaag 


aagttccacc 


gtctatcgtg 


gaatctggcg 


120 


caaatgggee 


tacgcctgga 


tgacgaacag 


gatattaccg 


ccacttcttt 


cactgtcatg 


180 


gctattttga 


tcccactgac 


atttaaggcg 


cggcctcatg 


gcggtgctta 


acegggateg 


240 


ggacatgttc 


agegcagaag 


cagactgegt 


aatgttgata 


tcactcagat 


aattaeggag 


300 


aaccgccaga 


catgcgcatc 


atcactccag 


ggcatcccac 


ttctccagca 


actccaccgg 


360 


gatctcattg 


atcacctccg 


agaaccgttt 


tcccaccagt 


ctttcagcct 


ggcgtaacag 


420 


tgggatggtc 


gggctactgg 


gttcactget 


ctcaaaccag 


egacgaatge 


accgcaggcg 


480 


ttccagcgca 


tcattacgat 


cgcgaattgg 


ccccggttca 


gcattgtgtg 


ggagggatat 


540 


accgggtggc 


gcgttgacag 


gcggcacatc 


ctctgcccgt 


geeggaaegg 


cattatccat 


600 
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aaccgcgcct gcggggtctg gcactggcgc cggcgacgga gggatttccg gcgrggttgt 660 
ctgtaccacg tcaggcagca gcgctaacaa ctgccgcagg cgtgaaaaat ctggagccag 720 
atcgcctaat gtctcacgcg cccacacttg caggcgttct gcgcittcct gcgcctcgcg 780 
aaacgccgcc agcggnaatg ccccacgggc ttcgaggtcg gccagctgct ggcggacgga 840 
crcgggggcc agcgcaccag ccgggcgggg agcgganaac gcgcgctcca catcccgtac 900 
ccgcaggcgc agggcggcac cgtttgacag cgtaataccg cgaatgtccg ccatcacgcc 960 
ttcgtgatcc agcagcgccg ccagggcatt actgcgcgcc agcgacgcat cttcaataga 1020 
tt . ct . gC g gat gcgccttcgc cggtaagcag ctgtggatgg agtgcatcag accagatcac 1080 
gctcagttct gccagtcgtg tgagcatttc cgccagtccc tgtgcgccag cctgccggat 1140 
gcgactgcgc agcagcagga tcaacacccg gatatcctta ctgcgcgnga gcagacggcg 1200 
cgcarcacgc tcaatttccg gccagtrcac ggctnccggc gtactgacaa aancaccata 1260 
trgcgcctcc gcctgggggg cggcgcggct gaacagcagc aggtattccg gatcgtactc 1320 
cggatcgggg ccgcagggtt; gtcccgcgct taccggttta gtcagggaca tgtccatatc 1380 
atcaccctca gtgggtcaag ccgtgttcag gttcaaaaat catgccggtc accggacggt 1440 
cacgcacggc tttacccgcc caggccgtcc agccaagccg ctgcgcgccg ccgatcaccg 1500 
caggcggtgc ggcatgaggc tccaactgca gctcaatttc ccagcaatat tcaaaaccga 1560 
tgaacgtacg caccagttct gtcagcacgg gcagattgtt gccgccgggc agaaaacgca 1620 
ggtaatcctc cagcgtgagc gggccgataa ttagccggaa tttgtactgc atatccggta 1680 
cggccrggcc gataagcgcc ccgttgccca gcaccgaaga cngcgcgggc g-gcccagac 1740 
gggcgacctc atcgcccgcc accgttaccc agcgcagggc gaattcrtcc accgcaaaag 1800 
gtacgctgaa atiagtgcgcc agcgtggcgg ccagcccgcc aggatcgcgc gatccgcgta 1860 
ccagatgggc ggaggctgcc aggcgaacat gatctgacag cgggcttncg gcgctttccc 1920 
gtagatcctg cccgctgaga ctggcgatat aaaacgcaaa acggtcgtgt cccggtttgt: 1980 
ccagcccgcc accagcggac tgggcgctgc gccatgcctg ccagaactgc gccagccagc 2040 
ggcggtgaaa aatattggaa aaatgaacca gcgtcggatc gtgacgactc tctgagcggt 2100 
tcagtgccag ctcggtatag tgcagcggca acgggccgtt tggcccccat agrccgaggc 2160 
tgcacaggct caggtgcagg cgtccatcct gccagccgac ctgggcgatt ccccgtggcg 2220 
caaaggtcat cgtcggcgtc tgtcccagac ggaatttttc catccgtggc ttgccagata 2280 
cttgcccgcc ggagcatcac agagctgggc atccacgcgc cgcatcaggt tcaggaatcc 2340 
gcaacgccag ggggtttcct gcgcctiggnt gagagcgccg gtcatacgct cccccgttgc 2400 
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ccggttctga ccggccaggt cacgacatgc ccgcgttgca tcgagtgcag cgtcatctgc 2460 
gagaaggtat taatggaaac atggcgggca atacagtgtt ccagcaccag accgaacagg 2520 
taaggactga taccggaaaa tccttcttcg tccacggtca gtccgcaact gacgccccgg 2580 
ccatagacca acaggccgga gccgggcagg cggcgggtca ccggggtggt tttgcagcca 2 64 0 
atcaggctgc gcacctggcg cgactgcggg ccgtcgtgag ccgggacaaa cagattcagc 2700 
agatcgcgca gcgcctggcc gccggtgcgg tgatccagac cggccagcgg cagacaatta 2760 
aacgacaact gccggaccag ccgccaggcc atttcgcgtt cagccagcgg cggctgcggc 2820 
gggcgcggcg gtctgataag acccacgccc gccaccggaa tcgccgcatc tacggtcaga 2880 
tcatcccggc cattacgtgg aataaggcag ggcagatcgc ggttagtcac cattgccgtg 2940 
acggtgatat ggcgcagatt ttccgggtag ggcgcttcat gctgatcaac cagcgagagg 3000 
aagacttccg agccggtata gggggttcgg gcgccacagc ggcgggcgtt: ttctgacgag 3060 
cggcgcggtc cacgacgcag tgaaaaacaa cgcccgcggt tgcctccgtc acratcacgg 3120 
gtgtgataca gcggacgaaa aatcatcttc cgtgtggttt ccgcctccag gcccccgact 3180 
tcctgaacag aaaacacctc gtaatccagc ggacgggtac gatccaccac cagatgctgt 3240 
tccgtcacgc tgtgagtgac ttcaatccgg gtggtggtgc gaggaagcag gttgaccacc 3300 
ggcgtacaga acaggctgaa ctgtgcagcg tccgtctgat gaatcagcca gtccggcggc 3360 
aggcggttaa gcagtatgac aatttccgcc acattgccct gcaccttttg taacccggca 3420 
gacaatccgg tcggggtgaa gaagtaaaac cgttccggac aggcgaaaaa ttcatgcagc 3480 
agattatggc cgtgaaacac gttccaggcg agcggcagca gcccctgccc tggctccagc 3540 
ccntcgtgcg ccaccgggtg ttgaagattc acattcagt: cgccgccaaa gcgaccgggt 3600 
tcaccggcca gtgtggcgac ggcgcnggta tgdagcagct caaacaggtg tgacgcaatg 3660 
cgttcttcgc cgcagaggca aaagggcagc cgtgccggac cggccagctc gctgaaagcc 3720 
agttccccga aggttcgcag ggtgatgcgc aatgccccgg cgacatgaac atcaggcggc 37 80 
agatagcggt gcagggcggg catatccggc ggcgcggcgg tcaggcgcac ctcctcgatg 3840 
gacagcggcc acagcgtgac gtcctggctg ctgcgaaact ggcaggcggt attttcgcct 3900 
tccgggatgg gggaaacgaa cgcggtatcg cgcggcacgg tgaccccttt cgccaggccg 3 960 
cctccctgcg tatcgggata cagctttacc actgccatcg anggcgtggg ggcgacgtaa 4020 
ttggggctga cgacttccag taaccgctgt gngaagcggg gaaactcggc gccaactctt 4080 
agctgagtgc gggcgctcag aaagctgaac gcctcgatca tgcgttccac atacgggtcg 4140 
gcaatatcgg ttccctgcat ccccagtcgg gcggcaattt tgggatggag ggtggcgaac 4200 
tcagcaccgg tctcccgcag gtagctcagc tcgcggccgc aacactccag tagccgtgga 4260 
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tccatgaata 


acgccctgca 


ttaaaagaac 


gcaacgcggc 


tcagctccac 


atccagcgcg 


4320 


ctgcgcacca 


gaaactccgt 


gggatacggc 


tgcgtcagaa 


tttgtccgcg 


aatttcaaac 


4380 


tgtagcgtgt 


tatagctacc 


ctgccgggct 


ttatcgagca 


agggcgcgac 


cctgagtgtg 


4440 


gcggcgttca 


gccggggttc 


gaaacggata 


atggcgcgcc 


ggatcgcccc 


gccgataccg 


4500 


tcccacttac 


gctcatacat 


aaagctgccc 


gccagcggcg 


gcaggccata 


gctgagcact 


4560 


gacgccgccg 


cctgcggata 


gcgccgggcg 


tcgatgtcac 


cctcgtggct 


aatggtattg 


4620 


agcaaaaagg 


agagatcccg 


gcgaatgatc 


tccttcagtt 


gtaccggcgt 


gacgcngata 


4680 


tcccggtcaa 


ttttctgata 


cggagcattg 


tcacacagcc 


gatcaaacag 


cgtgggcagc 


4740 


aggtggttag 


cgggtgtaaa 


acgagacgtig 


ctcatgcgcc 


accgttttcc 


tgagcatgaa 


4800 


aggtacaatg 


ggccatgtcc 


agcaggctga 


tatcgccgtg 


gccggtcagc 


cacaccctci: 


4860 


gccccagcgc 


ccgcacggcg 


grctcgccgg 


ggccgtcctg 


ccaggcggcti 


tccctgcaca 


4920 


gacgcagggc 


gtcggatgca 


ctttccgaac 


cgctgtaacg 


ggtaaagagc 


caggcgcccc 


4980 


gcgcatcgcc 


attcaccagg 


gtgatattaa 


cgggtttcca 


cagcagatca 


gccaggcgcg 


5040 


tcggttgcgg 


cgattccagc 


gagcgtattt 


gcgaaaacgg 


cagccagata 


cacacgccgc 


5100 


cggtgactag 


ctcaagtacc 


gggccaaggc 


gggaatcgct 


gtcgctcgcc 


cagtcaaatg 


5160 


cgccgccgct 


ccactgcccg 


cccgtgtctg 


ttatggcntc 


cagtgcggca 


tcacggtgtt 


5220 


tatcaacctc 


accggtatcg 


tcatgacagg 


cgagtgccgc 


cagcagtgac 


tccacccaaa 


5280 


cgggctgcgg 


cagaagaaaa 


ccgggtcgtt 


gttcaccctg 


aaaaacggtg 


tggcggaaca 


5340 


tttcgcagcg 


aaccagctcc 


cggtacagcc 


gggcctcctg 


ggtataatxg 


gcccccatcc 


5400 


cggcgcatag 


ctgaagctgg 


tgtagcgccc 


gcgaccagtc 


cccggccaca 


cacagcaact 


S460 


gaaacaggct 


gtggcggcag 


agcgcttccg 


ccggatcctc 


ccgaacctgc 


cgctccgcca 


5520 


tctgaatccc 


ctccgcaata 


gagcattcct 


gtatcagcgc 


ggacagggta 


gcaggaagcg 


5580 


tgtcagtttt 


tttcatgggc 


ggtatttcca 


tttttctgtg 


tcggagtgat 


tcggtagtgg 


5640 


ctatcgatgc 


caataatacg 


atgctcgcgg 


cgggtcagat 


cgggcagtat 


ctcgctatgt 


S700 


gcggtrtttcc 


cccccagttc 


cggcgagagc 


agttgcagga 


tatcgggcat 


tgatcccatc 


5760 


gccagccagn 


gcatttcacc 


ctcaccggtc 


gtgtccagcg 


tatccagaat 


ggcatcaata 


5820 


ccgggcgcgc 


ctgccaccat 


atcccgtaac 


gtgtcggtgt 


cgccctttct 


atcgcataac 


5880 


gacgtcaaat 


cctgggtgac 


agcgtcttca 


tttcggggaa 


acggtttagc 


acggaggggc 


5940 


ttttcccgcc 


cggagggccg 


caatgcctgc 


tgatattcct 


ggtaaagccg 


gtgcagtgtg 


6000 


ctatccgctt 


catggccagt 


gtaagaggat 


gcggtttccc 


ggacgggaat 


aatatcgaac 


6060 
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gggttctgcg gatcgagctg ttgttgttca aaccagccca gatccaggta cccagcaacg 6120 
gattcagatg actgaaccaa ctgaggaaac gaaacatcgg gacgggactc atcattggtg 6180 
cgggcaaggc accaggatga cagaccccat tcaatcgtat ctccatcatt cagccgcata 6240 
cggtgatgag gttccatcac ctgctcattg acggcacagc acagttcatc agagtgacta 6300 
acaatccacc acgcctcttc gtggcgaaca agactgagcg cgatatcgtc tttcccatgc 6360 
tgctcaaaaa gcggaatgta gggggcggcg gcagtaaata cgatgtatgt ccctgcggga 6420 
tactgtgcgt tatccgttat accctgcgac tgtattttcc gtaaccccca catgtcttgt 6480 
ttcattatgt cccttaaatg tatctatttt tctggaggaa acgtzttaggg agttttaatt 6540 
cataaataat tttaactaaa tttataggga gtcattattg aCgacaccct ttttattatt 6600 
tccgtcgcga atgcattgtt gtatgcatag atgtcttttt tgaaatatta tttcttttaa 6660 
tnct gcaact gcgactttta tcttgctgtt cattgattat ttaatatcaa cggctitagtc 6720 
gctaantagn ccccgatctg tatcaccgtc ctgcttcgat acrtcrtcga ggctatcaan 67B0 
aaagaattgt tcatatattt atatgcattg atgcattatt cttatgaatt tttatgtcac 6840 
aa gg C ataac acacggaaac tcctgrttca cgcagtgcgt tgtatggaaa actggccggc 6900 
ccactattcc ggtcgctgga atcggcaacg gcattctgca aaccacgccc taatccctgg 6960 
gttgagctga ctcactggct gcaccagtta acacagcagc ccgataacga tattctccac 7020 
gttcttcggc attaccagat ccctctttct gatgtggaga aagcgttact ccggcaactg 7080 
gatatgctgc ccgccggggc cagcgccant: agtgattttt ctcaccatat cgatctcagc 7140 
gccgaaaagg cctggatgct ggcgagcgtc cgttacggcg ataacaaaat tcgcagcggc 7200 
cggttgct - gc tggccttgtt gaccacgcca gaaccgcgtc gggtactgag cagtatctgc 7260 
gcgccgcngg ccacgcttcc ggttgacgaa ctgacggaaa tactgccctc gttgatcgaa 7320 
acatcgccgg aagcgcagga gcgcccttac gacggctccg gcctggcatc agccattccc 73 80 
ggcgaaagca gtcaggcgat tcccaacggc gggcaggacg gtaaatccgc gctggcaaaa 7440 
tactgticagg acatgacggc acaggcgcgc gacggcaaaa tcgacccggt gacggggcgt 7500 
gagcatgaaa tccgcaccac gacggatatt: ctgctgcgcc gtcgccagaa taatccacta 7560 
ccgactggtg aggcgggcgt cgggaaaacg gcggtcgtcg aaggttitcgc cctcgcgatt 7620 
gcgcaggggg aagtgccgcc cgcgctgcgg gaagtacggc tactggcgct ggacgctggc 7680 
gccctgtcgg ccggagccag catgaaaggc gagtttgaat cgcgtctgaa agggttactg 7740 
gaagaggccg ggcgcccgcc gcagccggct attcngtctg ttgatgaagt tcacaccctg 7800 
gtgggcgcgg gcggcgcatc cggcacgggc gatgccgcca acctgctgaa accggcgccg 7860 
gcgcgcggca ccctgcggac tatcggcgcc accacccgga gcgaatacaa gcgccatatt 7920 
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gagaaagacc cggcgccgac ccgtcgtttt caggcgttgc agattgccga accggaagag 7980 
atccccgcaa tggaaacgge gcgrggtccg gtggacacgc tggaaaaaca ccacaacgta 804 0 
ccgaccctgg atgaggcggc acgtgcggcg gtacagcttt ctcaccgcta cacccccgcc 8100 
cggcagttgc cggacaaggc caccagcctg ccggataccg ccgcggcccg cgcggcgccg 8160 
acgccgcaca cgccgcctgc cagcgcacag ttcctgcgcc agcagctaaa agcggcggaa 8220 
atggaacggt cgctgctgca gaagcaggaa aaaatgggga ttcagtcaga tgagcggcgc 82 8 0 
gacgcgctga tggcgcgaat tttcccgctc aacaatgaac tgactgcatc cgaatcccgc 834 0 
tggcagcggg agccggaact ggtacatacg ttgcaggaac tgcgtctcgc agagcctgat 8400 
gccgangaca aaaccacgct gcaacaggcc gaaacggcgc taagggagcg gcagggcgac 8460 
gcgccggcgg tgctcccgga agtcagcgcg gcggttgtcg cggcgattgc cgccgaccgg 8520 
accggcattc ccgccgggcg catggcgaaa gatgaggcca gccaggcgc; ggaaccgccc 8580 
gcccgaccgg cgcaacgcgt caccgggcaa gacggcgcgc tggcgcagat nggtgaacgc 8640 
atccagaccg ccagggcggg accgggcgat ccacgcaaac cggtgggcgc gcctatgccg 8700 
gccgggccgc ccggtgccgg taaaaccgaa accgcgctgg cgccggcgga ggctatctac 8760 
ggcggcgagc agaacctggt aaccaccaat atgagcgagt tccaggaggc ccacaccgtt 8820 
tccacgccga aaggcgcgcc gcccggctat gcgggctatg gcgagggtgg tgegctgacg 8880 
gaagcggcgc gtcgccaccc ctggagcgta gcgctgctcg acgagaccga aaaagcgcac 8940 
catgacgccc acgaactctt ccatcaggtg tctgacaagg gtgggatgga ggacggcgag 9000 
ggcacacacg ccgatzttcaa aaacaccacg ctactactca ccaccaacac gggttccgac 9060 
cccatcagcc agatgtgcga agacccggcc ccaacgcccg acgccacggg gcccaaagag 9120 
gcgctaacgc cggaattgcg caagcatttc ccggcggcat tcctgggccg cgcgacggcg 9180 
atccctcacc tgccgctgga cgaaacgtcg cgcggcgtga ttgcccgtct gcacctcgac 9240 
cggccggtgg cgcggatgag tgaacagcac ggcgcgacgc tgacgtatag cgaggaactg 9300 
gtcgcacaca ccgtggcgtg ctgtccaatg catgaaacgg gcgcgcggct gccgattggc 93 60 
cacatcgaac agcacactcn gccacgactg tcgcgctact ggttgcaggc cacgacggaa 9420 
aaagccgcga ccaggcagat tgatatcggc gttaatggtg atgagcagat tgttteegag 94 80 
accgtttgct gaaaccggcc grncgaagtg tccgcagtgc gattttaaaa actgtaccgg 9540 
cataccgctc cccttgcggc aaccagttga ctaaaaagga aatgaaggat catggccatc 9600 
aacaacagcg cgcagaaatc caccgcgcgc aaccgcgcgc cgcgcgcgca gaccgaatat 9660 
gacgcagaga tctacggctc cgagaaaaaa accgagccgc cgcccgcgat ggcggcgccg 9720 
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gccgatctgg ccgggaaacc gcgcgaagaa ctigccgccgg cgacggatcg caaatccctc 9780 
gatattgaca ccgataactt caatgagcgc atgaaagcca ttgcgccgcg cgtggcgttc 9840 
gctgtgccga atacgctgac gggtgaaggt cagttgatgg tcgataccac gctggaaaat 9900 
atggacgact tttcgccggc acagattgcc cgcaaggtgg acgccctgaa ccagctactg 9960 
gaagcccgca ctcaactggc gaacctccag acctacacgg atggcaaggc gggggcggaa 10020 
aatccggtca ataaactgtt gcaggacccg actctgctga aaacgctggc gaacgcgccg 10080 
aaatcggccg ctacccagca agatgtgtca gcggataatg aatcagcgga ataacgtcga 10140 
atttttaagg aattttcatg gcaaacagta atatgcaggc aaccgacgcg gttsgctcagg 10200 
ataccgcctc cgcatccggt gaacttgatg cgttgctgaa tcaggccttc cgacccaaga 10260 
ctacccaggc ggcaaaagcc gnggaagccg cggtgcagac gctggcgaac acgaccaccg 10320 
ccagcgatga cgcctataaa agcaccagcg ctattactigc gcagatcgac cctaaaccga 10380 
ccgaacagac caaactgatc ccgcaacacc ccgactggca gaagccggaa ccczcgtggc 10440 
gcggtatgga gcatctggtt tacaacaccg agaccgacga aaagctgaaa attcgcctca 10500 
tgaatctgtc aaaagatgaa ttgcggcgca acatgaagcg tcacaagggc accgcctggg 10560 
atcaaagccc gatgttcaag aaactgtatg aagccgaata cggccagtta ggcggcgaac 10620 
cCt _ atggc1 - g tatcatt g Cg gattactact tcgaccatac accgcccgat gtggatcngc 10680 
ttggctctat cgccaaagtc gccgcgtcgg cccatgcgcc gtttattgcc ggggcttccc 10740 
cctcggtact gcaaatggac tcctggcagg aactggcgaa tccccgcgac ctgaccaaaa 10800 
tcgtcaccca gaacctrggaa tatgcgccgt ggaacccgct gcgggctagc gaagactccc 10860 
gttatactgg cctgacgacg ccgcgttttc ttgcccgccc gccgtatggc gcaaaaacca 10920 
acccggcgga cgagtttgat tttgaagaag atgcggatgg ttctgaccat accaaatacg 10980 
tctggagcaa cgcggcctac gcgatgggcg taaacatcaa ccgttccttc aaacactacg 11040 
gctggtgtac gttgattcgc ggtgtggaat caggcggtgc ggtggaaaac cttccctgcc 11100 
at _ acct - tccc gactgacgat ggcggcgtgg acatgaaatg cccgaccgaa accgccatct 11160 
ctgaccgccg cgaggctgaa ctggcgaaaa acggttttat cccgttgatc caccgtaaaa 11220 
actcagacta tgccgccttt atcggcgcac agtcgctgca aaaaccacag gaatactacg 11280 
atccggacgc gacggccaac gctaacctgt ctgcccgtct accgtacctg ttcgcctgct 11340 
cgcgcttcgc tcacttcctc aaatgtatcg tccgcgacaa aatcggtccc tttaaagagc 11400 
gtgaggatat gcagcgctgg ctaaatgaat ggattacgaa ttatgtcgac gccgatccgg 11460 
tgaactcctc gcaagaaact aaagcccgcc gtccgctggc tgccgctgaa gtagcggtgg 11520 
aagaggccga aggcaatcca ggttattacg acgcgaaatt ccccctgcgt ccgcacttcc 115B0 
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agctcgaagg gctgacggga tcgctgcgcc tggtgacaaa actgccgtca gtgaagcagg 11640 
gcaatgcctg atatatattt tgtgaatgtt taagcgagtg aagtcagaga agatagagaa 11700 
tataaagagg gatatgaaga aaagaatttc gtctcgccca cggtctcgta aaggtggggt 11760 
acgtaatgat gacacatacc cgaatgccag caacaatgcc gaagcttttt atatcattga 11820 
gtaggaaaca catattatga ccataagccc aactcttcat ctgttacctg gtattgttct 11880 
gctctcttca caatatgctg tagcctggga agtcagttgc ccggctgtta ttgataccca 11940 
gtcttctgct gtgagcctga agtctgatgt cccagcggcg tggcagcttt ctccccgata 12000 
tatgtcgcgt ttatggttaa gtagtattgg ggtaacgcag ggtaaacctg aaaacctgat 12060 
ggacctcaaa ccagagacta aaaaagtaaa cggtgaaaat tggtctgtac gggaaacaga 12120 
acgtggtagc gataaagaaa ccgatcgcta nngggtttcg tgtatttatg gtcatgaaca 12180 
aatatggtug acgcaaccaa cacccgcttc ttctacccgc cgtaagaccc gtaatrttga 12240 
gggancgcca gaagaccagt ctgtaccttr tatctgtaat tagcgacctg agacgtgaaa 12300 
atttcagtac aggttatggt ttttattatc ggaagctatg aagcattatt tatatgcatc 12360 
aaataacgca aattcataaa ataactaaac acattatcgg taccggaaaa atatacagtc 12420 
ctctgttctc ctgaagttat tggagaagga ttctgcacgg caatgatcta tctataaaca 12480 
aaaagatata gataaaatca ggtttatttt aagtaaaact taataaggat ataaaaatgg 12540 
cttatgacat ttttttgaaa attgacggca tcgacggcga gtcaatggat gacaaacaca 12600 
aaaatgaaat tgaagtactg agctggcgct ggaatattca tcaggaatcc accatgcacg 12660 
ccggcagcgg ccccggctcc ggtaaggtct ccgccaccaa cctggatttt: gaccactata 12720 
tcgaccgcgc cagcccgaac ctgttcaaac accgcgcctc cggcaagcac actrccgcagg 12780 
ccattctggt tatgcgtaag gctggcggca atccgctgga gtacctcaag tatacctcca 12840 
ccgacctgat tgtcgccgtg gtttccccga gcggcagcca cgatggtgaa atcgcctccc 12900 
gtgaaacggt ggagctctcc ttcagcaccg tgaagcagga atacgtggcg cagaaccagc 12960 
agggcggcag cggcggcacc atcaccgcag gctacgactt caaggccaac aaagaaattt 13020 
aacgg ctgtt tttccggcca gatgttatgt ctggctggtt ttattgtttt gattttaaag 13080 
gaatttacag tgaataaatg gcgtaacccc accgggtggt tatgtgcggt agctatgcc: 13140 
tttgcactgc tcctgctctc cggatgcggc agtagcgatt cgctacttga cccctaatcg 13200 
cagcggcctg gcctgagcgt gaaagcgttt tacaaggtga attctgacaa tcagaagaaa 13260 
gcggcgtcca tgaagatacg cgttgagaat taacgaccta cacagaacct ctagaggtza 13320 
agcaaaacga acagaccttc atccaatgaa gcgcggttag ctttcaggaa ggtgaaccat 13380 
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tccgtcgctg atgtgggtag cattattggt ggaaacgctg ggaaaaatat aactggtggt 13440 
cactttcaaa atgcccgccc tattcgaatg agctatgtct tgaatgcgac agggttccca 13S00 
atagcccgca actctccgta tgcaaaggtt agtggtgccg ataataaatt ctatactcat 13 560 
cgcgtgaatg atatgactga ttatcttact cacactatgg gcaagcctga tctcattgtt 13620 
aacaatccga aacagagcga cctcatcggg aagaaaggaa ttatcgtagt aaaagggcat 13680 
ggctggagca atgccagagg acacgttaca ttatggaatg gcagtatctg ttcagatcag 13740 
cgccactcat taaatgaccc agataatgga ccatttgttc ctgaagttgg gacactgtgg 13 800 
atactgccgt gaaatggt:ca atattagtta ctttcagcac aagtggaatg ccggcatggc 13 860 
agccatcctt cgcacaagaa gcattgacca cacaatattc acagtcggaa ctcttaaaaa 13920 
attgggcgcc gagtcattgt ctggcattag catacaaaga tgatgtcgtt aaaaacgatg 13 980 
ccagagccac ggccagtgct taccttgaac atggcaaaca atctgtggag arntaccatg 14040 
aaattgacga gattgcgaaa aaatattcag ggttgaaata taacggtccg acatcatcag 14100 
actttaatac catgaagtgc atagatttta tccatgacag ggaattaaat gaattaatta 14160 
aaaggcgtrgt cgagaagtaa aattcaaaga tattaaaagc atacgttctc ttgctctgat 14220 
gaatttatgg gtaagaaaga gctigtacagg aatagttaat ctgttcacct aataaagcag 14280 
ataaatcagg gcttaattta ggtagttaaa aggatagtag atatgtctta tgacattttt 14340 
ctgaaaactg acggcattga cggcgagtca atggatgaca aacacaaaaa tgaaattgaa 14400 
gtactgagct ggcgctggaa tatccatcag gaatccacca tgcacgccgg tagcggtctc 144 60 
ggntccggta aggtcticcgt: cactaatct: tcatttgaac attacatcga tcgcgccagc 14520 
ccgaacctgc tcaaatactg ctcttccggc aagcacattc cgcaggccac rctggttatg 14580 
cgtaaggctg gcggcaaccc gctggagtac ctcaagtaca ctttcacaga cczgattatc 14640 
gcaatggtat cgcccagcgg aagccaggga ggggaaattg cgtctcgcga accaattgaa 14700 
ctctccttca gcaccgtgaa gcaggaatac gtggtgcaga accagcaggg tggcagcggc 14760 
ggcaccatca ccgcaggcta cgacttcaag gccaacaaag aaatttaacg gccgtttttc 14820 
cggccagatt tatatctggc cggatttatn attttgattt taaaggaatt tacagtgaat 14880 
gaatggcgta accccactcg gtggttatgt gcggtagcta tgcc-ttttgc actgctcctg 14 940 
ctttccggat gcggcagtag cgatgcgtta cctgacctcg aatcacagcg actcgacctg 15000 
agcgtgaaag cctccgataa ggtgaatcct gacaatcaga agaaggccgc gcccattgag 15060 
acacgtgttt acgaactgaa aaatgacgcc gctttcacga cagctgatca crggccgctc 15120 
catgacaacg acaaatccgt ccttaccgac gatttagtgc gtcgcgacag cttcatcttg 15180 
cgtcccggcg aagagaaaaa actgcgtcgc ccgctgaacg cgcagaccac ggcaatcggc 15240 
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gtactggccg gataccgtaa cctggccaaa tcggtctggc gggtaaccta caaaatcccg 15300 
gaagccccgg aaaaagcccg gtacagcagc ttcatatcgg ggaaaggaaa agtgcagttg 15360 
gaggcggaac tggaacaaag cgccattgta attacggaac gggataaatg aattatgagc 15420 
tggaatgacc gcgtagtctg gagtgaagga caatttttac tgccgcagat gtttcagcag 15480 
caagagcgtt atctggaaca cgtcatgcat taccgcagcc tgccgctgac cccctttttc 15540 
tggggattca gccactacaa tattgatggc gaagcgctga acatcggtaa actgatactg 15600 
aaagaggcat cagggatttt tcctgacggc acgccgtcta acgcaccgga ccacaccccg 15660 
ctgccgccgc cactgaccat tctgccggag cacctgaacc agcagatttg tctggcggta 15720 
ccggtacgcg cgccgaacag cgaagaaacc acgtttgaca ataacccgga atcattggcg 15780 
cgtttctcgg cacacgaaca cgacatccgc gacgccaact cgctgggacg tggcgcgcag 15840 
ttattacagc tcagccatcc gcgcctgcgc ccgctgccgg aaaaggcggt gacgggcgcc 15900 
tggattggcc tgccgctgac ccgcatcacc gggttgaacc ctgacgggcg gacagatatc 15960 
gaccacgacc tgatcccgcc catcattaat tatcaggcca gtccactgat gcgtacccgg 16020 
ctgccgtgga tcaacgatct catccggatg cgggccgatt cgccggcgga acggccgacc 16080 
ggcagcgaca accacggcca tgaagcagcg gaggtctccg attacctgct gccgcaaatt 16140 
ctcaatcgct ttgagccgct gctgactcac ctggcgaaaa ccccgctggc cccggaggtg 16200 
ctgtaccgct acctgtccga actggccggg gaactctcca cctatgtgcg tccacaaacg 16260 
cgacggcccg ctgaatacaa agagtacaaa cacctgacgc cctatgccgg gttgaaatcg 16320 
ctggttgatg aggtgcagcc cctgctgaac gcggtaccga tccggggcgc gcagcgcatc 16380 
gagctgaaag aggggactta cggcatcctg aatgcggcgg cggccccttc cgatcccgcc 16440 
gatttcagca cgctggtact ggcgataaag gcttcaatgc cgaccgacgt gccactgcaa 16500 
cattttgccg cccagaccaa aatcgggcca cccgatcgcc tgccggaact gacccgctcg 16560 
catctgccgg ggctggcttt gcaggttccg cctgtaccac cgcgccaaat cccgtttcag 16620 
gccggataca tctattacga catccgccgc gagggagcat tgtgggaaca cattgcccgc 16680 
tacggcggga tggccacgca taccgccggg gaatttccgg ggctggagac agaactgtgg 16740 
ggagtgcgcg ataaatgaca gacagtaccc tgacgccgcc agcggcggat atgatgtcct 16800 
tcctgtccac cacgccggaa cataaggaca gtgaacatga aacgccggta cacaccagcc 16860 
agcgcacgga actcaatgtc atcgttgaag acggtccgga cagcaaactc cggctggctg 16920 
aaatcagcgc ggcggctaac ccgtcgctcg ccgctgcccg gcccttattg tgcgctctcg 16980 
cagccatgcc cgctaaactg gatgcggccc tggtagagcc ttaccgtaat ctgctggtac 17040 
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gcgagatgca tccgtaccag acatcacgcg atcaggcgaa cctgcggcgc gagcacgtac 17100 
eggcggtacg ctactgcctg tgtacggcgc ttgatgaagc cgccaataac acaacctggg 17160 
gacggcgcgg cgtctgggcc ggaaaaagcc tgctggtaac atttcatggt gaaagcgaag 1722 0 
gcgggataaa acttttccag atcatcgggc gtctggcggc cagcttccag gagcatggca 17280 
acgtactgga ggttatctac cacccgctgg ggttgggatt tgaaggccgc tacagcgtgc 17340 
agccagacgg gcgtaagcaa ctggacaata ttcgccagca actgctgaca cagctttcac 17400 
agcgtcgcga tccggttatg cccgcgctct cgcctgactt tcagggggcg ataagcggac 17460 
gactgcggcg gatgcgccgg gtgccggtct ggctgagcgc cgggatagcc ctgttggcga 17520 
tgctgacgct gtttggccct: tacagccacc ggatggatgt gcagaccgtc accgtacaac 17580 
agcatattga tgcgattggt ataaaactgc cgccgccgcc tgtgccggtt cataagctgc 17640 
ggccgaaaac cctgctggca aacgaaatcg cccgtggccc gctgaccgtg gacgaagatg 17700 
accagcacag tagggtggtc trccgtggcg acgccatgct tgtgccggga cagaaaacgg 17760 
tgagtgacgc aatccggcca gtgattaaca aagcggcgcg ggaaatcgcc cgcgtgggcg 17820 
gcgcagtcac tgtaacgggt cacactgaca gccagcccat tcattcggct gaattcccat 17880 
ccaacctggt actgtcggaa aaacgggcgg cggaagttgc ggcgttgctg acctccggcg 1794 0 
gcgtacctgc cggacgggta cataccgtcg gcaagggcga tacggtgccg gtggcggata 18000 
acggcagtaa agccgggcgg gcgaaaaacc gtcgggtgga aattctggta gtggagtgaa 18060 
tgaatgatga aaaaatcaac ctatgatgtg tctcatcatt cggcagtatg tggcgtgacg 18120 
agggattact accggatctc agcgacatat cacataacac gatctgttcg tgtt^ttttg 18180 
atcatcttat gtcgcccgcc acccggcggc gcttttgccg gacccccgat taacgcagga 13240 
ctcatttccc ccgataatgt caacctcagt actcaggatt tcctgaaatt cutgccact 1S300 
gacaacgtac agaaaaaaga caatgcactg atgtatatgc tgggggtgga ggatgcgaca 18360 
gaaggtaaag cctggtgtgg atatggtcag gttgacagta taacaataaa ccatactgtg 18420 
ctgacccggt ttgaacagca cgcagtgaaa aagccngatg taagggcttc aatactaata 18480 
gaggaagcat tagttaaaaa ctttccctgt cagaggacag actcctccat aaaaattgcc 18540 
tcccggtcat ctcccatttt atccctgacg ccggatgcgc ttaatctctc aggtaatgac 18600 
ttttttaaat tttgggtgtc tggtaatcaa cgggataaac tcagggcggg tgcctatctg 18660 
ctcggcgtgg aggatgcgac agagaacaaa ctgtggtgtg gatacgctrtt atctaagacg 18720 
ctaacattaa atgaattagt ctatgtttct cttaaaaata aaaccaatga ggaactgaat 13780 
tctcgcgcgg ctgaacttat cataaataaa ttaatagagt atccctgtaa tacataaaat 18840 
cacccaagtt gcatcaaggc ggcaagggag tgaatccccg ggagcgtaca ttagctcgcg 18 900 
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actggggtga gcgaggaaag ccaacgcaca tgcagcctga agtacgacag gtataccctg 18960 
tgataatgtt atcgctgcca gttcagataa agtctcggtg gtaaagtctt ggtggtaaga 19020 
tattgatatg tttaaggtaa tggtctgtct gttggtgggt gttcctgcca tctcttacgc 19080 
gcacgattat gggtgtgcta cggtcggagc atcaatggag ccatcattat ttgatgccat 19140 
aaaaaatgac ctgaatatag atgtcgctac tattataaag gataaaacaa aggtagagat 19200 
tcttgatatc tcacctgtat ctaaggtcta tgcagaatct ctggccagga tggattatga 19260 
aaaagataag gccaaaaata aagcggcaat attagataaa aaatcctatt tcgatagtta 19320 
ctatgaaaat caggttaaat ctaccgtggc aaaatatacc tatattaaca aagataaaga 19380 
aaaagacatt ttcattgcat ccagcttcac gaatgctgat gagtgttccg caagattcaa 19440 
tggctatatt actttatcca gggaacttta aaacattgac cattgggtgg cgagtcgaaa 19500 
tataacgcaa aaatttctta gtccgctttt ttcccggcgc gcgcrggcag Ctgtgggccc 19560 
cccggttctg gcgctgctgg tctggcctigt cgggccgccg gtgtcatttg ataccctgcg 19620 
cccgctggcc tccgtgggta gccgggtagt gaccattgcc ctgtcgctga tgctgctggt 19680 
actgtggctg gtcaactggt cgatgagtat catcggcatc agtgtcctgt gcctggcgat 19740 
tggcctcgtc acaccgctgc tggccccggg cgatgtccat ccgtttgcgc cgctgtgggc 19800 
ccgcctgacc ctgattggtt tcatcctgct gatgtacgcg ctgtacggcc tgtaccggc: 19860 
gtggcgtgcg ctgcgtatgg atgaacaact gctgcgtcgc ttcctgcatc cgcgcgggga 19920 
agaggtaccg gtggcaggcg agatcaaagc cgacctgcgc accgtcaacc atattgtcac 19980 
gcaggccatc cggcagctgc ggcagttgcg ggnggatatg cctggctggc gcaaaatctr 20040 
cgagggaaaa cgctttctgc atgagctgcc gtggttcang gtggticggca gtcccggcga 20100 
cggcaaaacc acggccctgc tgaacaccgg attgcagttc ccgctggcgg agcaaatgga 20160 
gcagacttcg cgcatcctga cagtaccggg tggcggcacg ctacactgcg acnggtggtt: 20220 
taccaacgaa gcggtgttga ttgataccgc cggacgctac gcccgccacg atgacggtgg 20280 
tgaagcgagc gccgcgcagc gtaacgccgg agagtggcag ggctttctcg gtzctgctgcg 20340 
taaacatcgc cccggcgcgc cgcttaacgg cgtgatcctg acgctaaacg tggcggacur 20400 
aaccgcacag tcaccggcgg aacgcctggc ggcctgcgcc gccccgcggg cgcgactggc 20460 
agaactgcgc gagaccccgg ggattcgctt tccggtctat ctggnggcca ccaaaatgga 20520 
tttgtcgccg gggttcagcg aatattttcg cacgctgacc agccatcttc gtgcacaaat 20580 
ctggggcctc acgttgccgc acagccgcag gcgaaaagcg ggcgacccgc aggcgctgca 20640 
cgccgcctgc gcgcaggagc tggcgcgcct gacgctgcgg ttggatcagg gactggatac 20700 
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ccggtcacag gaagagcacg accttaaaag ccgccagcgg ctgtatacct tcccgcgtga 20760 

gttcgccgcc ctcggcgagc cgttgctgga ggctattgaa cagatcttcc tcgattcaaa 20820 

attcgatgcc acgcaactga ataacacgct gcgcggggtg ttcttcacca gcgccgcgca 20880 

ggcgcaggcc gacgccgtgg ccgaccagtt gagtatctgg cagcgctttg cccgggcgat 20940 

aaaaaccgcc cgtggcgaac cctccgcccc tctcccacac gctctgccgg acggcaaccg 21000 

cagctacctc ctgcatgacc tgctgacaca gtttatttitit cgtgaagcgc acctggtgga 21060 

gccaaacctc cagtgggcct ggcgttaccg cctgctgcgc ctcggcgggc acctgctggt 21120 

actggtgctg gcattcctgt tgtggcaggg gatgcagacc agccagcaga ccaacggcga 2118 0 

ctatctgaat gaaatcagcg cccgcgcgac ccggctggac ggtgatgtga aagcctacac 21240 

cggtaaaccg gcgatggctc ccgtcccggc actgctggac agcgcaaggg aactgcccgc 21300 

ctggccggaa ctggacccgg acgcgccgcc gccggccngg cgctacggtc tgcacagcgt 21360 

accgccggca accgacagcg tggcgtcgct gtacaaccgr ctgccggacc aactgccgct: 21420 

gccgccgctg gtgaaacgga tggagtatgt gctggcggac gccattgccc gticaggatag 21480 

taaagcggcc tacgatgccc tgcgcatcta tccgctactig aatctggata aagaccacga 21540 

agataaacac aacgcggcgg agatccagtc ctgggtgatc aacgatctgg ggaacagcga 21600 

cagcgtggcc gggtncggcg ggcgcgccgc cgtgctgacg catatcgaag cgctgtttga 21660 

cggcagccgg gtggtgcatt caccgtatga gaaagatgag gcgctgatcc gccaggcgcg 21720 

ggcattcctc gacggtcaca ccagtaccga gcgtatctac gcgcgggcgc tggcggcaat 21780 

ggagagcgaa gcgccgcagg agttcacgct ggtacgcgcc gtcggcgcgg atgcgggaac 21840 

ggcccttgcg cgtagcaacg gcgcgccgc: ggaccggggc gtgccgggna tttccacccg 21900 

tgaaggatac cgggagctgn tcgacaaacg actaccggaa cctgcggcgg cggcgacggc 21960 

gaacgatggc tgggtgatgg gccgggagag tacgccaaaa aagctgactg acagcctgcg 22020 

cagccagaca ccggggcagg agcagcctgt cgcccgcgaa gtccgccgtt tgtacctgac 22080 

ggaatatgcc cgccgctggc aggattttct ggacagtatc catagtatca acagtgccgg 22140 

ggaagagggc agttccggcc cggcctatga tttacaggtg ctgcgcaccc tggcgtcgcc 22200 

ggactcaccg ctgatgcggc tgggaaaagc ggtggtggag cagaccacgc tggtgccgcc 22260 

gccggacccg caggccagac agaaacaact ggcgcagcgc gcatccggca acgcggggaa 22320 

agtggtacag acggcaaaac tgttccagga tattcacccg gaagaacggc tggaaaaaac 22330 

gctggtggat gatzcgtttcg ccgcgctgcg cgaggtcatt gccgggcgga cggacggcgg 22440 

acagagcggc ggtgggacga tgcagatcgc ttcgctgctg accatgccca acgagtatca 22500 

cacccaactg accaccgctg atagcgcgct ggcggcgggg acgttgccag cgcgcaccac 22560 
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tgccgcagac aagttgcaac cggaggcggc gaaactgccc gcgccgctga aaaacatcct 22620 
gctggatctg acgaagcagg gaacgcgcaa aatcaacgcc gggaccggcg acgtgctgaa 22680 
cacccagatg gaggcgatga tgggcgacga ctgccgtgac gccatcgacg ggcgctatcc 22740 
gricgccgac agtccgcagg aggtcagcgc cgaggacntit aaccgcatct tcgccagcgg 22800 
cggcgcactg gatgcttcct ggagcaaaca actggctccg ctggcggaca ccgccagcga 22860 
cccgcggcgc tacaaaccga ccgaaggcaa catgacattg caggggccgg atctgacgcc 22920 
gtrxcagcag gcgaagcaga tccgcagcgt attttttaac agcgagggcg ggaaaaaatc 22980 
ttcctggtcg atgcagatta gcgtggtgga tatggacccg gccatcacgg aactggtgat 23 04 0 
tgatatcgac ggccaggtgc tgcgctacgc ccatggtccg gaccgtcccc tgaaagtaac 23100 
ccaaccggga ccgcgcaacg gcrcgatggc ggaaatcacc gccagcccgc gtatccgcca 23160 
ccacazctca acgctactga ccggcgggcc gcgggcgcrg tttcatctcc tggacgccgg 23220 
aazggtacag gaaacggcgg tgcgggggcg tcaactggtg gaacatgact tcgacggrcg 23280 
ccgggtggtg ctggaaatca ccgccgggcg ggattttaac ccggtcagcc gggagcrgtr 23340 
gcagaacttc agttgcccgg cgagggcgct gtaatgcgcc ggccagcagt ggtatcagcc 23400 
ccggcgatgt tcggcaggct gccggaccag cgcgactatg tgcgctggcg agtgggcgct: 23460 
gaagagggac gaatatggca ggactggctg aaccgccaga cgtgggtggg cagcgggcgg 2 3 520 
cacatcgtgt tgccgggagg ccagacgacc gatgaagagc gtgacggctg gatgcacctg 23580 
tc=rccgcgtg gcgaacatga tatgcccgcc ccggaacacc atccgctgcc gcggagc--r 23640 
accacgccgc cgggatrctt gccgatgggt gggacgaccg actgcctgac cggagcgctig 23700 
accacgtcac gggatagcgt gggacgcccg tggccgctgg tgatttacca gcgccgcggti 23760 
cgcgagtggc tggacgagag tntgcaggag acgcagggct ggctgtactg gctggcgagg 23 82 0 
ctggcggcgc agcatatcac gccggacacg atgcggcgcg ggcggctgac cgagcaggcg 23880 
gaccagctat gggcgatgtg gcagccgggg ccgtggtggg cgcagtggct gcgtgggtra 23 94 0 
cgacgtacat cgcaacgcag ccgggaactg accgggctgc cggatgaagc accggtagta 24000 
gaacrgccgg gcgtgcgtta tctgccgcgg ccgggctggc cgggtaagac gctggggcag 24 060 
gcaacaccgg ggcaaggctg gttctggcag cagaacagtg aagggcggta tgtggatgcg 24120 
ctgcggttgg tggacgaaaa aaatagaggt cagatgctgt gaaaaagtac gctgtagagg 24180 
ttctgtttat gtctgcctgt gcaggaatgt ccctgcccgt atttgcctgg ggcggaacag 24240 
atgtaaacat tgataacccg ctggcggagc gtgtggatac ccatcccgtt catcgtcagg 24300 
aaatgganaa ccttaccatt ccgaaaacaa cggtcacact gaaaaaaagc accggtgaat 24360 
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gcggccgtLt cccaacatta accaactata ccagctcact ggcgcaggac gttgagggtt 24420 
acggacgcgg aagtgcatat tcccttcagg aggggaatat ctcactggcg aagatgcagg 24480 
gacgttatcc tttcagcttt gtattatcgg tggataacca gtctgtacgg gatcagaagc 24540 
tggcgctaac gatacgttgt acaccaccgc tgtaatacac agaacagtca gggagaagat 24600 
gacggcagca agactgactt ttgacgggca aaagctgaca tggcctggna tcgggacatt 24660 
taaggcgacc acggggttac cggatttaca gtrggccagat aaacagtgtg tgccggatgc 24720 
ggcgataccg gaagggaatt ataaactgtt tattcagttt cagggggagg caccgataag 24780 
aaatgctgcg gattgtgatc tgggaccatc atggggctgg agtaccactc cgcgaggcca 24840 
ggctgccgga acatgtgaga tacactgggc gaactgggga cataatcgca tccggctgga 24900 
atcagcggat gagaagaccc gaaaagcctg tgggggcaag cggggtggtt tttatatcca 24960 
tgattccacc aaaggttaca gccatggtug tattgaagtg gaaccggrgt ttinnccgtat; 25020 
tccgaaacag gagacggaaa aagaaaatgg ngaaaagaca ttcacggtta atgttaagta 25080 
tgtccccggt cagcaaacga atggtggaac aaaacaacaa taccgttaaa accggatgaa 25140 
attaccgtgt ctggacgtgc tgcggttgac aatgattaca atggtggcca gataagtaac 25200 
gagcaccaat gataaagtat atgactggcg gtgtgctggt tattaatcaa atttctataa 25260 
aatgcaacgc gaatctggta agcgtaataa aaataaaatc ataacattgt tatgttcatt 25320 
tccttattta tgtaattcag tatttatgtt atgtgctaat ttttgtgttt ttattttcat 25380 
ggctcttgcc agcaatatac cctgttctzc tggtaaatat ttaaatataa caggctggtt 2544 0 
gcattataaa gtgcggggca ctgtttcctg acggtgagtc tatttattct aatccggtat 25500 
taaaggagcc actaccatga gctttgtatc cacaaataat aaatccggta tgggagggct 25560 
gacgacaacc acgccgccga taaccggaga aagtggcggt gtcaccgcag attcagtcgc 25620 
cggaagcgtg gcagatgcgg cggaatccgc cgtggaacag gcngcgggat cgctatttgg 25680 
cgcattgccg gagccatcag gactggtgaa agccgcggta gcagcggcgc aggctgccgc 25740 
cgccgcaggt: atggcgcagg atgcggtatc ggccatcgtc tctgctgttg caggcgggcc 25800 
gggggcgcat aatgtgacgg tcagcggcag cgccgtaccg ccgggcgcat tactgttcgc 25860 
cagcctggac ggcggcgaaa catcaagtga actgttcagc tatgtggtac agctaaaaac 2 592 0 
gcccgacacc ctgaatctgg gctatgtctc cccggcggcc aacctgccgc tcaaaccgat 25980 
ggtgggcaaa gatctgtgcg tcaacatcga actggatggc ggcggtaaac gacatatcag 26040 
cgggccggcc acggcggcgc gggcggtggg ccatgaaggg cgttcggtca cctatgagct 26100 
gcgtatggag ccgtgggtaa aactgctgac ccataccagc gactacaaag cattccagaa 26160 
taaaaccgtg gtggatattc tggatgaggn tctggcggaa tacccctacc cggnggaaaa 26220 
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gcggctggtg 


gaaagctacc 


cggtacgcac 


ctggcaggtg 


cagtacggtg 


aaactgatct 


26280 


tgattttctt 


cagcgaccga 


tgcaggagtg 


gggcacctac 


tggtggtttg 


agcacagcga 


26340 


ggacagccac 


acgctggtgc 


tggcggatgc 


catcagcgcc 


cacaaagcat 


gtccggactc 


26400 


gccgctggtc 


gagtggcacc 


aggaagggct 


gaagctggac 


aaggagttta 


tccacactat 


26460 


cacggcaaac 


gagagcccgc 


ggactggaca 


gcgggtgctg 


gatgatttcg 


actttacgaa 


26520 


gccacgttca 


ttgctggcaa 


acaccgtggc 


aaacccgcgt 


gaaaccggtc 


atgccaccta 


26580 


cgagcattat 


gagtggccgg 


gagactactt 


cgacaagagt 


gaaggcgaga 


tgctgacgcg 


26640 


cattcgtatg 


gaagcgcagc 


gcagccccgg 


cagtcgggtg 


ctggggggag 


ggaatatccg 


26700 


cacactcatg 


accggttaca 


ccttcacgct 


ggaaaactat 


cccaccgccg 


aagtcaatca 


26760 


ggaatatctg 


ctgatgcaga 


ccttgctgtc 


tgtgcaggac 


aacgcgcagc 


acagcgggca 


26820 


ggaccagcac 


ttcacctrct 


ccacccgttt 


tgaactrgcac 


cccacccgcg 


aggtgttcca 


26880 


cccgcagcgg 


acggtgagca 


aaccccacac 


caaagggccg 


cagagcgcca 


tcgtcaccgg 


26940 


cccggcgggc 


caggaaatcc 


ggacggatca 


gtacgggcgg 


gtaaaggtac 


agtctggtcg 


27000 


ggatcgctac 


ggcaaaatgg 


atgaaaacag 


cacctgctgg 


atacgcgcca 


gctacccgtg 


27060 


ggcgggcaaa 


ggcttcggga 


cgatccagat 


cccgcgtatc 


ggccaggaag 


tgctggngga 


27120 


tttcaaaaac 


ggcgatccgg 


at-ctgccgat: 


catcgtrgggg 


cgtacctaca 


accaggacac 


27180 


catgccgccg 


tggggactgc 


cgggaatggc 


gtcgcagagc 


gggatcttca 


gccactcgcc 


27240 


gtatggcggg 


ccaacgaacg 


gcaacatgct 


gcgttccgac 


gacaaaacgg 


gcgcggagga 


27300 


agtgaagttc 


cacgcggaaa 


aagatctcaa 


caccacggcg 


aagaataatg 


aaacgcatac 


27360 


ggttatggtg 


gatcgcacta 


aaaccattat 


taaaaatgaa 


accaacagca 


ttggtgagga 


27420 


cagaaacacc 


acggtaacga 


agaatgacgg 


cctttccgta 


aaactggcgc 


agacgatcaa 


27480 


tatcggcacc 


acttatcgtt 


tagatgttgg 


cgatcaattc 


acgcttcgct 


gcggcaatgc 


27540 


ggcgcttgtt 


ttacataagg 


acggctccat 


tgagttttgt 


ggcaagcaac 


tgatgttaca 


27600 


taccagcgat 


gtcatgcaac 


tgattggtaa 


aggcattgat 


atgaacccgg 


atggcggcac 


27660 


agccgtaacc 


gccgacgaca 


ttgcccccct 


tctcacctct 


gagtgatcrg 


aattaaacct 


27720 


ggagtcctca 


tggatcgacc 


ataccgcata 


caggaagggt 


gttttgtcct 


gcctgaaaca 


27780 


tttacggatc 


gcagcgtcaa 


tatttttatc 


ctggagggca 


acgaacgaac 


atcgcccagc 


27840 


ctgaatatct 


cccgcgatac 


gctaaaaccc 


gatgaagacc 


tgcccgccta 


tattgaccgc 


27900 


cagattgcac 


tgatgaaaaa 


aaatctcggt 


cagcaccggg 


tattgccgcg 


agcgcctgca 


27960 


caggcaggaa 


cgggcaacga 


tgcccttatg 


ggggaacaaa 


ttgccgccac 


ccataaatcc 


28020 
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gggaaaacgg aagcgtacca gcgccaggcc gggtttattg caacccctgg caaggtactg 28080 
gtcttcaccc tgaccagtcc ccgccctttt gatgataaag cagacctact ctggaacacc 28140 
tggccggcag gctttcagcc ggataaaaac gaataaccac acggaggtgt gaccatgcat 28200 
gaagcagccc gtgtggatga tcctatctac cacaccagcg cgctcgccgg gtctctcatc 28260 
ggcgctatca tcggcatcgc cattatcgcg ctcgccgcct ttgccttctt tagctgcggt 28320 
ttcctcgccg ggctgattct gggtcttatg gccgatcaaa tagcctccgg ggtactgcaa 28380 
ctgggcgagg ccatcgggcg ctccatccac cacacggcag gaaaaatcct caccggttcg 28440 
gagaatgtca gcaccaacag tcgcccggcg gcgcgcgcgg tactgagtac ggtgaaatgc 28500 
gataaccata tcgcagaaaa acgcatcgcc caagggccgg aaaatatcca catcaacagc 28560 
cagcccgccg cccgtaagga cgaccacacc gaatgcgacg cggtgattga agacggctcg 28620 
ccgaatgcgt ttctcggcgg cggcacacag acggcactgg aaatcagttc tgaaattccg 28680 
gaccggccgc gcaaggtggt ggatgtattg tttgtcgtgg cgagtctgct cggcgggctg 2874C 
gccggggcgt ggcggcaggc ggcaaagctg gggacgaaat tcggcactaa atgtgccgct 28800 
aagtttatcg gcggggagct tgtcgggatg gccgtgggtg aggctatcag cgggctgttc 28860 
agcaanccgg tggacgtgac caccgggcag aaaatcctgc tgccggaaac ggacttcacc 28920 
ctgcccggtc gcctigccggt cacctgctcg cgtttttacg ccagccacct ggaaactgtg 28980 
ggaccgttgg gacggggctg gcggctgaac tgggaaacca gcctgcgcga tgacgatgaa 2 904 0 
cacatcacgc tgaccggcgt acaggggcgg gaactgcgtt acccgaaaac gatgctgacg 29100 
cccggccacc agatatttga cccggaagaa cagttatacc tcagccgcct gcatgacggg 29160 
cgttacgtgc cgcactacac cgatcgcagc cattacgcat ttggtgattt tgacagtgac 29220 
ggcatggcat acctgctgtt tatggagacg ccgcaccgcc agcgcattgt cctcgggcac 29280 
gaaggaggca gactggtacg gatagcctcc agcagcgggc atcacctgtt actgcaccgc 29340 
acacagaccc cggcagggga gcggctgtcg cgaattgaac tggtgcaggg cggcacccgt 29400 
ggcaatctgg tggagtaccg gtatgacgat aacggtcaac tgaccggcgt ggtgaaccgg 2 9460 
gcgggaacgc aggtgcgtca gtttgcttat gaaaacgggc tgatgacggc gcacagcaat 2 9520 
gcgacggggt tcacctgccg ctaccgctgg caggaactcg acggcgcgcc gcgcgtgacg 29580 
gagcacgaca ccagtgacgg cgaacattac cgctttgact atgattttgc cgcaggcacc 29640 
accaccgtca ccggcaggca gggggagaca tggcagtggt ggtacgacag ggaaacgtat 29700 
atcaccgcgc accggacgcc gggcggtgga atgtaccgct tcacgtacaa cgaagaccac 29760 
ttccctgtca acattgagct gcccggcggt cgcacggtgg cgtatgaata tgacatccag 29820 
aaccgggtgg tgaagacgac agatccggaa ggccgggtga cgcagacgca gtggaacggc 29880 
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gagctcgacg aaatcacgcg cacggcgctg gacgatgacg ctgtctggaa aacgcagcac 2994 0 
aacgcccacg gccagccagt gcaggagacg gacccggaag ggcgggtgac gcagtacgct 3 0000 
tacgatgaac aggggcagat gtgcagccgg acggatgcgg cgggcggcac gcagggcggc 30060 
gtgcggcggg agacgcagca gcgggatgcg ctgggccgtc tgttacggac ggagaatgaa 30120 
cacagccagc ggacgtccag ctacaaccgg ctggaccaga taacggcagt gacgctcacg 30180 
cccacggagg cggggcaaca gcagcaccgg atgcaggccg acacggtgcg ttttgagtat 3 0240 
gaccgcagcg gctggctgac ggcggagcac gcggggaacg gtagcatatg ttatcagcgc 3 03 00 
gacgcgctgg gcaacccgac ggacatcacg ctgccggacg ggcagcacct gacgcacctg 3 0360 
tatracggga gcgggcarct gttacagacg gcgctggacg gcctgacggc gagcgagtat 30420 
gagcgcgaca gcctgcaccg ccagataatg cgcacgcagg ggcagctngc gacgcacagc 30480 
ggccacgacg acgacgggcr. gctgagctgg cagcgcagtc tggcgtccgg cagcgccccc 30540 
gttcrtcctg gccagcgccc ggcgcggcag ggctgcgtga cgtcgaggga ctattaccgg 30600 
aacaaccacg gcgaggtggg cacgattgac gacggcctgc gtggcagcgt ggtgtacagc 30660 
tacgacagaa gcggttacct gaccgggcgc tcaggtcaga tgtatgacca tgaccgttat 30720 
taczacgata aggcgggcaa cctgctggat: aacgaagggc agggagcggt gatgagcaac 30780 
cggccgccgg gctgtggtcg tgaccgttac ggctataacg agtggggcga gctgaccacg 30840 
cggcgcgacc agcaactgga gtggaacgcg caggggcagc tgacgcgggn catcagcggc 30900 
aacacggaga cgcactacgg ctacgatgcg ctggggaggc gaacccgcaa ggcgacgtac 30960 
gggcggcaca cgggccatac ggcgcggagc cggacggact ttgtgtggga ggggctcagg 31020 
ccgrcgcagg agaacgtgca gcagcagggc tggcggacct atctgtacga tgcggaacag 31080 
ccgtacacgc cggtggcgag cgtgacgggg cggggagaaa gcaggcaggt gtggtattac 31140 
cacacggatg tgacgggcac gccgcaggag gtgacggcgg cggacggaac gctggtgtgg 31200 
gcggggtaca tcagggggtt tggagagaat gcggcggaca tcagcaacag cggggcgtac 31260 
tcccaccagc cgctgcggcc gccggggcag tattttgacg acgagacagg gctgcatcac 31320 
aatctgttca gatattatgc accggagtgt ggacggtttg tcagtcagga tccgatcggg 31380 
ctgaggggcg ggttaaacct: ttatcagtat gcgccaaatc ctctcaaata tatagaccca 31440 
ctcggtttaa ccgcgactgt tgggcgatgg atggggcctg cggaatatca gcaaatgctt: 31500 
gacactggga cagtagcaca aagttcaaca gggacaactc atgctgccta ccctgccgan 31560 
atagatgctt ttggtaagca agcaaaaaac ggtgctatgt atgttgaatt tgatgtgcct: 31620 
gaaaaatcat tagtacctac aaatgaagga cgggcaaaaa cagcagggcc agattccatc 31680 
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gaagggcgat 


cagctaaacg 


caaaggctcg 


cctgtccctg 


aaatgccaac 


agcagaaaac 


31740 


acaactgtaa 


ggggcgagaa 


aattaatggg 


gaagttgaag 


caaaatgcta 


aataaactta 


31800 


aattgtgggt 


gagcaaacat 


actgattata 


cggtaattca 


taatgaaaat 


gatttatctt 


31860 


acagtattat 


tatagatttt 


gaagatgacc 


ggtatatacc 


aagatttact 


gtatgggatg 


31920 


acctaagctg 


tatgtcagaa 


gtaatggatg 


cggatactgg 


cttatataaa 


ttaaacaaga 


31980 


gaaacgaatt 


ttctacattt 


gatgaactcc 


tggatatatt 


tgatgattznt 


atgataagta 


32040 


ttaaacaata 


gttggccggg 


taagaagtta 


actcttcccg 


gctgttttat 


tatctaaccc 


32100 


ccatcaatcc 


ggagacgcgc 


taccggtacg 


atgcgctggg 


caggcgggtg 


agcaaggcga 


32160 


cgtacgggcg 


gcacacgggc 


catacggcgc 


ggagccggac 


ggactttgtg 


tgggaggggt 


32220 


tcaggctgtt 


gcaggagaac 


gtgcagcagc 


agggctggcg 


gacatatctg 


tacgatgcgg 


32280 


aacagccgca 


cacgccggtg 


gcgagcgcga 


cgggaaaggg 


agaaagcagg 


caggtgtggc 


32340 


attaccacac 


ggacgtgacg 


ggcacgccgc 


aggaggtgac 


ggcggcggac 


ggaacgctgg 


32400 


tgtgggcggg 


gtatatcagg 


gggttcggag 


aaaatgcggc 


ggacatcagc 


aacagcgggg 


32460 


cgtacCCtca 


ccagccgctg 


cggccgccgg 


ggcagtattt 


tgacgacgag 


acagggctgc 


32520 


attacaatct 


gttcagatat 


tatgcaccgg 


agtgtggacg 


gttcgtcagt 


caggacccga 


32580 


ttgggctggc 


ggsgggggct 


gaatctttac 


cagtatgcgc 


ctaatccgat 


tagatggatc 


32640 


gatcctctag 


gacttgctat 


cctggagcat 


caat.ct.aatc 


ttgatgcggc 


aaggagaacc 


32700 


ggatttgaaa 


atgcgggtat 


gacaaaccct 


gaggatgtca 


ctttctcgaa 


agtcgatccc 


32760 


aaaaccggta 


ccgttgttga 


gtttaaaggt 


ccaaatgggg 


ctaaagttgc 


ttatgatgca 


32820 


cctcatgcag 


atatggatgt 


gacagcaggg 


catgataaac 


cacatgttgg 


ttggcaaccc 


32880 


qcaggaaaaa 


gaggttccgg 


aggagctaat 


agaggtaata 


Ctacctacga 


tggcccacaa 


32940 


catccgcatc 


gctctgactc 


taagggagat 


gataaatgtt 


aaattcaaat 


acgtctgaac 


33000 


ttagaatcga 


actggagaat 


gcgattaaaa 


atctcggtat 


tcatgattat 


cgtgtcgata 


33060 


aacccgaaca 


aatcgtttct 


gagataaaag 


agatatatgt 


taatggtaat 


cctagaacct 


33120 


ggtggttatc 


attaaaacat 


agacaatatg 


tcttttctta 


taccgataat 


tctggatata 


33180 


aaaacaLaLu 


acaaataot a 


aataaacaac 


tcaatqaaaq 


caatgtaatc 


aacaaacata 


33240 


tatttttgat 


tgctgatgaa 


gataatgagc 


aaatatatgt 


atataacgtt 


cctctcaact 


33300 


ccctgcctga 


aattatagaa 


aattgcagat 


attttgaata 


ttatgttgca 


gatcacgaac 


33360 


tatctcggct 


tatatgtgaa 


aacgatcatg 


gtgatttgat 


tgtatgccca 


accactaagt 


33420 


aaagcgcgag 


tgctctttag 


cgatatagt t 


gcccatattt 


aggcgttact 


agccgaagat 


33480 


ggcgcgattg 


tctggcaggg 


gaaacagcaa 


ttctgaggtc 


aggaagatag 


cacaacccat 


33540 
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taaccgggat agatccgcta gacctgaatc 
gttacttgct cctggagcaa ataaacctta 
acgaagggcc gagcattaaa gcactggcag 
atttgacgaa aacgtaatcc aatggaaagt 
atataaaacc agaaccggaa ccacaggtga 
atataattca tttgatcatg ggcgaacaga 
aaatagtaag ggagttggtt ccggtggagg 
ggctgtgata agaagtcgag aacaatgctg 
tttaaattag atgaagatag atattgtttt 
catctctccg aattatttga tataatcaaa 
agtaacgcaa ggcgaatcat tgaaccaatt 
aaattagaaa acggaagtga ccggagaatt 
aattcagatg gtatctattt tgcactcgga 
tacggaaacg atttcctcat ttcagaaagt 
aaagggggtt. tcgatatcaa aaaacggctt 
gggaaagatc ttttgtcttc ccggatttta 
cccccgcctt aatatgcttc atcgactttn 
tcacttcgtc cgtctctgca atcaggatca 
cgtgcccggt ctcaaatccc gcttcttcct 
cttgngagna acaggcggtc atgcaggtct 
aggcattcat atttgctgac taaataaatt 
tggagcccag ggtgagtgag tatgggcgcg 
gccagccccg gccggtgatg agcagccggc 
gcctgcatca aaatctgttc agatattatg 
atccaatagg gctgaaaggg ggatggaacc 
atagtgatcc tcttggcctt attacttgtg 
taagacgaaa aatggctgaa aataattgcg 
caaaaagcaa cggatttgct tgctggaatt 
gttgacgttt ttactacagg aataagtatg 
agtaatttaa tgttgagatt cggcaaggag 
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cagttgatgc gacaggttat agggtttatg 33600 
ttacattggt attactaacg atatggtttg 33660 
gttatcaaaa gaaaatggaa ggatgctgcc 33720 
cagaggttac gagaaatatt acatagagaa 3 378 0 
aaaaattccc tcaacaaata gagagaataa 33840 
tcccagcgca caagcattca aagactctta 33900 
aaaatgcgga tgagtgatta agaattttgg 33 960 
cgttttgtga agcccggaga catattttgt 34020 
gggcgaatta taacactaat gactgtcgga 34080 
aaaccccctg gaataacaga gccagaaatt 34140 
atagtggata catattctrt atttgataag 34200 
attggtcatc aggttaatta caatccaaaa 34260 
acaggtgatt cctgtaaaaa gaaagactgt 34320 
gagtggaaaa cacttcctaa attatctcct 34380 
gaaattgcct gaaaatgaaa ataaaaagcc 34440 
ttatttaatc cccgttcacc acattattta 34500 
Ccacctgata aagctccttc cgtagatccc 34560 
aacacccctc ggagatctcc acggtgacgc 34620 
gccagtcacc cttaacgtgc tggctgggga 34680 
cgcngctgaT: atggcggacg ccgacgccgg 3474 0 
cttatttatc cgccggatgc tggctgattg 34800 
acatcctggc accgcgcctc cctctccccg 34860 
tgccggggcc gtattttgac gatgaaacgg 34920 
taccggagtg tggctggtcc gtcagtcagg 34 980 
gatatcattc tccgctgaat cctatcacag 35040 
gtgctgatag aggtgattct ggcaagttat 3S1O0 
ttagtgaatg ctcatttacg cctctaccta 35160 
gtgtcaacga atgcaaataa aatcaatgct 35220 
gttaacgata aggatacagc tatattaatt 35280 
cttgatgaat ctgttgctgt: tgttcagtcc 35340 
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cgct:gt:gat - g aggatgaatt taatgtatac cgagaaacgg tcggttttat cacgggcgaa 35400 
acgcttatta aaataatgaa cccattacat gaaaaacatc cagaaataaa accaaaagga 35460 
ttgaaacaaa acatctggaa ccggatgaat aatgtgtaaa agccggaggg gttatctttt 35520 
cccggc tttt tattatcaat tactcattaa ctcctgttcc gttctcttgc gtttaatcac 35580 
cggaatatct ccggtattgt tcagcgcccc ggaaatgttt tcaaccactg tcccgcactc 35640 
cgtttattaa tgcgggttac gcccacccct tcaatacagc caaagagtcc gtgggtatgc 35700 
tgcggcgtga tcacgatgca atccctcatt acccgcacct tgacgggcat cccgttaata 35760 
aaccccgcct gcggcatcca ttcaccgtac aaacgaactg aaagtctctc aacgcgtgag 35820 
tatgtaagta tcccgcataa tcgagccatt cacatttaga gatcatccga cacaatcaat 35880 
ctgccaacgc aggagatcgc tatgcgtaaa gcccgtatta ctgcgcacca gatcatcgct 35940 
gcgattagat cagtcgaacc cggacggacr gttaaagatg tctaccggga gcccggtact 36000 
tctgaagcca ccagggacaa ctggaggtct ggacacggcg gcagggacac gcgtggaatc 36060 
acaaaaggct ccaccgtatt tactgtctgc tcaagctgaa ttttcgccgt aagggtaaac 36120 
aacggctgcc ggtacgcaat ccctcgccac tggtcacgcc ggaagcgctg aaccagagct 36180 
ggtctgtggg cgtcgttttc gcacgttcaa tgctgttgat gactgtaacc gcgaagcgtt 36240 
gtcgactgaa atcgatctga atctgccagc tctgcgagtg gtccgtgtac tcgacaggat 36300 
tacagcaacc gcggttatct ggccatgctg cgtatggata agggaccgga atttatctcg 36360 
ctggcactgg ctgaatgggc aaagaaacat ccagtaaagc tggcgtttat ccagccgggt 36420 
aagccgaaga aaaacgtttt catcacgcgc tttaaccgga cataccgtac agaaataccc 36480 
aattcttatc tgttcagaac gctgaatgag gtgtgggaaa ttacggataa agggttatca 36540 
gaatataact gcgaacgtcc acatgaatcg cggaacaata tgataccgaa ggaataccgc 36600 
caataacgtt atctggccgg aatcttaaaa atgcatggaa ctaaaacggg tctatttaca 36660 
ggggcacctg cgatgaattt cgctgcactg aaaagcgata ccggatgaga gctgcttcaa 36720 
attaatgtgc catgttcacg gggaggttgt gcgacgtttg cataatccag caagaactga 36780 
aaggaagggg agagcttttt catgcctgta taatcagtct ggcctgtgtc agtcagctct 36840 
tagtgttgag accctcgttg gagcgttata attgcttttc tgtttcggaa aacaagattt 36900 
tccattaaag atcttccctg cgaggaaaag ttaactaata atcttaccgt cgagttagga 36960 
gatgtatgtt taaatataaa caatgttgca acgatgcctg ataattatcc tctcttcgaa 37020 
gacaagtttc ccacacccag tgtagtaggt gtcatggtaa tgttatcact tgaatgtaaa 37080 
tggaaggtat aattgctttt tgactggcat tctattccac cctgacaaca cgatgttaac 37140 
atcaacactg tttatatcgg caataacgca atttttttca gattaagagg tgctctgata 37200 
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37376 



tatagacttt tatgacatta cttatttgaa ttggtaacaa ataccaataa gtacaagctg 37260 

ttattaccag ccacggattt tttacatacg gtaagatttg gtatggcgtt atgtattctg 37320 

aaCQtqctgg atcattttaa tttggtttaa aaaaggtggt tattcaa atg aaa age 

3 3 Met Lys Ser 

1 

ata aaa aaa ttg att ate gca agt gcg teg age atg atg get get agt 

He Lys Lys Leu He He Ala Ser Ala Leu Ser Met Met Ala Ala Ser 
5 10 15 

tgt tat get ggc tea ttt ttg ceg aac tea gag caa caa aaa tea gtg 
Cvs Tyr Ala Gly Ser Phe Leu Pro Asn Ser Glu Gin Gin Lys Ser Val 
20 25 30 35 

gat att gtg ttt tec tct ccc caa gat tta ace gta teg ctt att cca 
Asd He Val Phe Ser Ser Pro Gin Asp Leu Thr Val Ser Leu He Pro 

40 45 50 

a:q ceg age tta aag get ggg aaa aat get cet age gcg aaa att gcg 
v a " S er Gly Leu Lvs Ala Gly Lys Asn Ala Pro Ser Ala Lys He Ala 
55 60 65 

aac ctt gta gtt aat tct act act ctt aaa gaa ttc ggg gtc agg ggg 
Lys Leu Val Val Asn Ser Thr Thr Leu Lys Glu Phe Gly Val Arg Gly 
70 75 80 

att tct aac aac gtg gta gac agt act ggc act gca tgg cgt gta get 
He Ser Asn Asn Val Val Asp Ser Thr Gly Thr Ala Trp Arg Val Ala 
85 90 95 

ggt aaa aat act ggt aaa gag ate ggt gtg ggc tta tea agt gac agt 
Gly Lys Asn Thr Gly Lys Glu He Gly Val Gly Leu Ser Ser Asp Ser 
100 105 HO 1" 

c-- aga aga tct gat age acg gaa aaa tgg aat ggg gtg aac tgg atg 
Leu Arg Arg Ser Asp Ser Thr Glu Lys Trp Asn Gly Val Asn Trp Met 

120 125 130 

ace ttt aat age aat gac aca ctt gat att gtc ctg aca gga ceg gcg 
Th- Phe Asn Ser Asn Asp Thr Leu Asp He Val Leu Thr Gly Pro Ala 
135 140 145 

cag aat gtc aca get gac acg tac cca ata act tta gac gta gtg gga 
Gin Asn Val Thr Ala Asp Thr Tyr Pro He Thr Leu Asp Val Val Gly 
150 155 160 

tat caa cct taa tagtaaacaa ctattagtgt attgtgcctt gtttaaggcg 
Tyr Gin Pro 
165 & 

caatacacat caaatcatct atttttcttt tacaattttt gat atg aaa ata gtt 

Met Lys He Val 
170 

aat ttt get gta atg gcg gta get ttg ttc gec act aat tct atg gtt 
Asn Phe Ala Val Met Ala Val Ala Leu Phe Ala Thr Asn Ser Met Val 
175 180 185 



37424 



37472 



37520 



37568 



37616 



37664 



37712 



37760 



37808 



37856 



37908 



37963 



38011 
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tea gta tat gee gtc aac cag caa tea aat tea gec act aaa tta ttc 
Ser Val Tyr Ala Val Asn Gin Gin Leu Asn Ser Ala Thr Lys Leu Phe 
190 195 200 

age gtg aag ctg ggg get aca cga gtg att tat cac get ggt acg get 
Ser Val Lys Leu Gly Ala Thr Arg Val He Tyr His Ala Gly Thr Ala 
205 210 215 

gga gcg acg etc teg gtg age aac ccg cag aat tac cct att ttg gtt 
Gly Ala Thr Leu Ser Val Ser Asn Pro Gin Asn Tyr Pro He Leu Val 
220 225 230 235 

cag tct tea gtc aaa gca gca gac aaa agt teg cct get ccc ttt ttg 
Gin Ser Ser Val Lys Ala Ala Asp Lys Ser Ser Pro Ala Pro Phe Leu 

240 245 250 

gtg atg ccg cct eta ttt cgt tta gaa gca aac cag cag agt caa ctg 
Val Met Pro Pro Leu Phe Arg Leu Glu Ala Asn Gin Gin Ser Gin Leu 
255 260 265 

cgt att gtc cgt act ggt ggt gac atg cca acg gat cgt gag act tta 
Arg He Val Arg Thr Gly Gly Asp Met Pro Thr Asp Arg Glu Thr Leu 
270 275 280 

cag tgg gtc tgt ata aag gcg gta cca ccc gaa aat gaa ccg teg gat 
Gin Trp Val Cys He Lys Ala Val Pro Pro Glu Asn Glu Pro Ser Asp 
285 290 295 

aca cag get aag ggc gcg acc ctt gac etc aat ttg tec ate aac gee 
Thr Gin Ala Lys Gly Ala Thr Leu Asp Leu Asn Leu Ser He Asn Ala 
300 305 310 315 

tgt gat aag ctg att ttc cgc ccg gat gee gtg aag ggg acg ccg gaa 
Cys Asp Lys Leu He Phe Arg Pro Asp Ala Val Lys Gly Thr Pro Glu 

320 325 330 

gat gtt gca gga aat tta aga tgg gtg gag acg ggc aac aaa ctt aag 
Asp Val Ala Gly Asn Leu Arg Trp Val Glu Thr Gly Asn Lys Leu Lys 
335 340 345 

gtg gag aac ccc acc ccg ttt tac atg aat tta gec tct gtc aca gta 
Val Glu Asn Pro Thr Pro Phe Tyr Met Asn Leu Ala Ser Val Thr Val 
350 355 360 

ggg gga aag ccc att aca ggg ctt gag tat gtc ccc ccc ttt get gac 
Gly Gly Lys Pro He Thr Gly Leu Glu Tyr Val Pro Pro Phe Ala Asp 
365 370 375 

aaa aca eta aat atg cca ggt agt gec cat ggt gat ate gag tgg aga 
Lys Thr Leu Asn Met Pro Gly Ser Ala His Gly Asp He Glu Trp Arg 
380 385 390 395 

gtt att aca gac ttt ggt ggt gaa agt cat ccg ttc cac tac gtt ctt 
Val He Thr Asp Phe Gly Gly Glu Ser His Pro Phe His Tyr Val Leu 

400 405 410 



38059 



38107 



38155 



38203 



38251 



38299 



38347 



38395 



38443 



38491 



38539 



38587 



38635 



38683 



aaa taa atccaggggc ttagcggcag aaa atg aag ttc aaa caa cct gec ttg 38736 
- Met Lys Phe Lvs Gin Pro Ala Leu 

Y 415 420 

eta ctg ttc ate gcg gga gtg gtt cat tgc gca aat gcg cac act tac 38784 
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Leu Leu Phe lie Ala Gly Val Val His Cys Ala Asn Ala His Thr Tyr 
425 430 435 

aca ttc gat gca tea atg ttg ggc gat gca gcg aaa ggg gtt gat atg 38832 

Thr Phe Asp Ala Ser Met Leu Gly Asp Ala Ala Lys Gly Val Asp Met 

440 445 450 

teg etc tct aac cag ggg tta caa cag cca ggg act tat cgc gtg gac 38880 

Ser Leu Phe Asn Gin Gly Leu Gin Gin Pro Gly Thr Tyr Arg Val Asp 

455 460 465 



gtg atg gtg aat ggt aaa cgt gtc gac acc cgt gat gtg gtg ttc aaa 
Val Met Val Asn Gly Lys Arg Val Asp Thr Arg Asp Val Val Phe Lys 
470 475 480 485 



tgg cgt gtc cgc aat gcg acc age tgg cag egg agt agt caa ctg teg 
Trp Arg Val Arg Asn Ala Thr Ser Trp Gin Arg Ser Ser Gin Leu Ser 
615 620 625 

ggg aag tgg cag gca gca tat acc tat get gag cgt gga ctg tac tea 
Gly Lys Trp Gin Ala Ala Tyr Thr Tyr Ala Glu Arg Gly Leu Tyr Ser 
630 635 640 645 

eta aaa agt cgt ctg act ctg ggg caa aag act teg cag ggg gag ata 
Leu Lys Ser Arg Leu Thr Leu Gly Gin Lys Thr Ser Gin Gly Glu lie 

650 655 660 

ttt gat agt gtg cca ttt acc ggt gtg atg ttg gca teg gat gac aac 
Phe Asp Ser Val Pro Phe Thr Gly Val Met Leu Ala Ser Asp Asp Asn 



38928 



ttg gaa aag gat ggg caa gga acg cct gtt ctg get cct tgt ttg acg 3 8976 

Leu Glu Lys Asp Gly Gin Gly Thr Pro Val Leu Ala Pro Cys Leu Thr 

490 495 500 

gtc agt cag ctt tea cgc tac ggc gta aaa acg gaa gat tac cct cag 39024 

Val Ser Gin Leu Ser Arg Tyr Gly Val Lys Thr Glu Asp Tyr Pro Gin 

505 510 515 

ttg tgg aaa gca gca aag ccc cca gat gag tgt gcg gat ctg acc gee 

Leu Trp Lys Ala Ala Lys Pro Pro Asp Glu Cys Ala Asp Leu Thr Ala 

520 525 530 

att cca cag get aaa gcg gta ctg gat ate aat aat cag caa ctg caa 

lie Pro Gin Ala Lys Ala Val Leu Asp lie Asn Asn Gin Gin Leu Gin 

535 540 545 

ctg agt att ccg cag ttg gcg ttg cgt ccg gaa ttt aag ggg ate get 

Leu Ser lie Pro Gin Leu Ala Leu Arg Pro Glu Phe Lys Gly lie Ala 

550 555 560 565 

cca gaa gat ctt tgg gat gat ggt att ccg gcg ttt ctg atg aac tac 

Pro Glu Asp Leu Trp Asp Asp Gly lie Pro Ala Phe Leu Met Asn Tyr 

570 575 580 

agt gcg agg aca acg cag acg gat tac aaa atg gat atg gtg ggg cgt 3 9264 

Ser Ala Arg Thr Thr Gin Thr Asp Tyr Lys Met Asp Met Val Gly Arg 

585 590 595 

gac aac tct tec tgg gta caa ctg caa ccg gga ate aat ata ggt gcg 39312 

Asp Asn Ser Ser Trp Val Gin Leu Gin Pro Gly lie Asn lie Gly Ala 

600 605 610 



39072 



39120 



39168 



39216 



39360 



39408 



39456 



39504 
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atg gtg ccc 
Met Val Pro 
680 

gcc cgc acg 
Ala Arg Thr 
€95 

tac aac acc 
Tyr Asn Thr 
710 

gta aca gac 
Val Thr Asp 



ggc agt aca 
Gly Ser Thr 



ctg cac cag 
Leu His Gin 
760 

teg tea gac 
Ser Ser Asp 
775 

atg tat ggt 
Met Tyr Gly 
790 

gca acg cat 
Ala Thr His 



egg tgg ggg 
Arg Trp Gly 



cag ggg gag 
Gin Gly Glu 
840 

aac cag ctg 
Asn Gin Leu 
855 

tat gcc teg 
Tyr Ala Ser 
870 

cga cat aat 
Arg His Asn 



age teg cgt 
Ser Ser Arg 



665 

tac agt gag 
Tyr Ser Glu 



cag get egg 
Gin Ala Arg 



act gtg gcg 
Thr Val Ala 
715 

agt agt ggt 
Ser Ser Gly 
730 

caa atg tct 
Gin Met Phe 
745 

gga tat ttg 
Gly Tyr Leu 



tct gca acg 
Ser Ala Thr 



ctg ccg tgg 
Leu Pro Trp 
795 

aat caa get 
Asn Gin Ala 
810 

agt tta tct 
Ser Leu Ser 
825 

gcg gta cag 
Ala Val Gin 



act gcg acg 
Thr Ala Thr 



cag ggc tat 
Gin Gly Tyr 
875 

ggc aac cgt 
Gly Asn Arg 
890 

act acc ctg 
Thr Thr Leu 
905 



670 

egg cag ttt 
Arg Gin Phe 
685 

gtg gag gtc 
Val Glu Val 
700 

ccc gga ccg 
Pro Gly Pro 



gat ctg cat 
Asp Leu His 



gtg gtg ccg 
Val Val Pro 
750 

aag tac age 
Lys Tyr Ser 
765 

gat aag egg 
Asp Lys Arg 
780 

aat etc act 
Asn Leu Thr 



gca ttg ctt 
Ala Leu Leu 



gtc gat gga 
Val Asp Gly 
830 

caa gga gcc 
Gin Gly Ala 
845 

ggg aca aat 
Gly Thr Asn 
860 

aac acc eta 
Asn Thr Leu 



eta tgg teg 
Leu Trp Ser 



atg ttg agt 
Met Leu Ser 
910 



get ccg gta 
Ala Pro Val 



aaa cag aat 
Lys Gin Asn 
705 

ttt gca ctg 
Phe Ala Leu 
720 

gtc acc gtg 
Val Thr Val 
735 

tat cag acc 
Tyr Gin Thr 



ctg ttg gcg 
Leu Leu Ala 



cag ate gcg 
Gin lie Ala 
785 

gca tac ggc 
Ala Tyr Gly 
800 

ggt ttg ggg 
Gly Leu Gly 
815 

age gac aca 
Ser Asp Thr 



tec tgg cga 
Ser Trp Arg 



ttt ttt ctg 
Phe Phe Leu 
865 

tec gat gtg 
Ser Asp Val 
880 

tgg egg gaa 
Trp Arg Glu 
895 

cag tea tgg 
Gin Ser Trp 



675 

gtg cgt ggg 
Val Arg Gly 
690 

ggt tac acc 
Gly Tyr Thr 



c 99 9 at ct 9 
Arg Asp Leu 



tgg gag gcc 
Trp Glu Ala 
740 

ccg gcg ata 
Pro Ala lie 
755 

ggc cga tac 
Gly Arg Tyr 
770 

cag get acg 
Gin Ala Thr 



ggt ata cag 
Gly He Gin 



gga tct etc 
Gly Ser Leu 
820 

cac agt cag 
His Ser Gin 
835 

ctg cgt tac 
Leu Arg Tyr 
850 

acg aga tgg 
Thr Arg Trp 



etc gac agt 
Leu Asp Ser 



aat ttg cag 
Asn Leu Gin 
900 

ggg agg cat 
Gly Arg His 
915 



att 39552 
He 



att 39600 
He 



teg 39648 

Ser 

725 

gat 39696 
Asp 



gca 39744 

Ala 



cga 39792 
Arg 



ttg 39840 
Leu 



agt 39888 

Ser 

805 

ggg 39936 
Gly 



cgt 39984 
Arg 



age 40032 
Ser 



cag 40080 
Gin 



tat 40128 

Tyr 

885 

ccg 40176 
Pro 

ttg 40224 
Leu 
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ggc aat ctg age tea acc ggt tec cgt acc gac tgg cgt aat cgc ccc 

Gly Asn Leu Ser Leu Thr Gly Ser Arg Thr Asp Trp Arg Asn Arg Pro 

920 925 930 

ggt cat gat gac age tac gga ctg agt tgg gga acc tct ate gga ggg 

Gly His Asp Asp Ser Tyr Gly Leu Ser Trp Gly Thr Ser lie Gly Gly 

935 940 945 

ggc teg ctg tea ttg aac tgg aat caa aac aga acg ctg tgg cgc aat 

Gly Ser Leu Ser Leu Asn Trp Asn Gin Asn Arg Thr Leu Trp Arg Asn 

950 955 960 965 

ggc gcg cac cgt aaa gag aac ata acc age ctg tgg ttc agt atg cca 

Gly Ala His Arg Lys Glu Asn lie Thr Ser Leu Trp Phe Ser Met Pro 

970 975 980 

tta age cgc tgg acg ggg aat aat gta agt get agt tgg cag atg act 

Leu Ser Arg Trp Thr Gly Asn Asn Val Ser Ala Ser Trp Gin Met Thr 

985 990 995 



gca ttc agt cag caa ctg gat tgg gag gtg cgt cag agt tac cgt gec 
Ala Phe Ser Gin Gin Leu Asp Trp Glu Val Arg Gin Ser Tyr Arg Ala 
1015 1020 1025 

gat gee ccg cca ggt ggt ggt aat aac age gca ttg cac ttg gca tgg 
Asp Ala Pro Pro Gly Gly Gly Asn Asn Ser Ala Leu His Leu Ala Trp 
10 30 1035 1040 1045 

aat ggg gat tac ggc ctg tta ggt ggt gac tat age tac age egg gcg 
Asn Gly Asp Tyr Gly Leu Leu Gly Gly Asp Tyr Ser Tyr Ser Arg Ala 

1050 1055 1060 

ata cgc cag atg gga gtc aat ate gcg gga ggt ata gtt ate cac cat 
Met Arg Gin Met Gly Val Asn lie Ala Gly Gly He Val He His His 
1065 1070 1075 

cat ggt gtg acg ctg ggg caa cct ttg caa ggc tea gtg gcg ctg gtt 
His Gly Val Thr Leu Gly Gin Pro Leu Gin Gly Ser Val Ala Leu Val 
1080 1085 1090 

gaa gcg cca ggg gee teg ggg gtg cca gtt ggc ggc tgg cct ggc gtt 
Glu Ala Pro Gly Ala Ser Gly Val Pro Val Gly Gly Trp Pro Gly Val 
1095 HOO 1105 

aag acg gat ttt cgt ggc gac acc aca gtg ggc aac ctg aac gtc tat 
Lys Thr Asp Phe Arg Gly Asn Thr Thr Val Gly Asn Leu Asn Val Tyr 
1110 1H5 1120 H25 

cag gag aat aca gtc age etc gat ccg teg cga eta ccg gat gac gca 
Gin Glu Asn Thr Val Ser Leu Asp Pro Ser Arg Leu Pro Asp Asp Ala 

1130 1135 H40 

gag gtc aca caa acc gat gtg cgc gtg gtg cca acc gaa ggg gcg gtg 
Glu Val Thr Gin Thr Asp Val Arg Val Val Pro Thr Glu Gly Ala Val 
1145 1150 1155 



40272 



40320 



40368 



40416 



40464 



tea cca tea cac ggt gac cag acg caa caa gtg ggg gtc aac gga gag 40512 
Ser Pro Ser His Gly Gly Gin Thr Gin Gin Val Gly Val Asn Gly Glu 
1000 1005 1010 



40560 



40608 



40656 



40704 



40752 



40800 



40848 



40896 



40944 
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gcg gaa gcg aag ttc cac act cgc ate ggg gec agg gca ctg atg acg 

Val Glu Ala Lys Phe His Thr Arg lie Gly Ala Arg Ala Leu Met Thr 
1160 1165 1170 

ctg aaa egg gaa gat ggt age gec att cct ttc ggg gcg cag gtt aca 

Leu Lys Arg Glu Asp Gly Ser Ala lie Pro Phe Gly Ala Gin Val Thr 

1175 1180 1185 

gtc aat ggg cag gat ggc agt get get ctg gtg gat act gat age cag 

Val Asn Gly Gin Asp Gly Ser Ala Ala Leu Val Asp Thr Asp Ser Gin 
1190 1195 1200 1205 

gtt tat etc act ggt ttg gcg gat aag ggc gaa ctg acg gtg aaa tgg 

Val Tyr Leu Thr Gly Leu Ala Asp Lys Gly Glu Leu Thr Val Lys Trp 

1210 1215 1220 

gga gca cag caa tgt egg gtt aac tac cgc eta cct gec cac aag gga 

Gly Ala Gin Gin Cys Arg Val Asn Tyr Arg Leu Pro Ala His Lys Gly 
1225 1230 1235 

ate gcg ggc ttg tat caa atg age ggt etc tgc aga tag ccgattctga 

lie Ala Gly Leu Tyr Gin Met Ser Gly Leu Cys Arg 

1240 1245 1250 



age gta age tta ctg gtc get gec agt age ttg gtg ccg ata gcg aac 
Ser Val Ser Leu Leu Val Ala Ala Ser Ser Leu Val Pro He Ala Asn 
1265 1270 1275 

gec gca gaa aaa ctt cag aca acg eta cgt gta ggt act tac ttt egg 
Ala Ala Glu Lys Leu Gin Thr Thr Leu Arg Val Gly Thr Tyr Phe Arg 
1280 1285 1290 1295 

get ggg cac gtg cca gat ggg atg gtg ctt gcg caa ggc tgg gtg act 
Ala Glv His Val Pro Asp Gly Met Val Leu Ala Gin Gly Trp Val Thr 

1300 1305 1310 

tat cac ggc agt cac age ggg ttt egg gta tgg age gat gag caa aag 
Tyr His Gly Ser His Ser Gly Phe Arg Val Trp Ser Asp Glu Gin Lys 
1315 1320 1325 

gcg ggt aac acg cct acc gta ttg ctg ctg age ggg caa cag gat cct 
Ala Gly Asn Thr Pro Thr Val Leu Leu Leu Ser Gly Gin Gin Asp Pro 
1330 1335 1340 

cgc cat cac att cag gtt cgc ctg gag ggc gag ggg tgg caa cca gat 
Arg His His He Gin Val Arg Leu Glu Gly Glu Gly Trp Gin Pro Asp 
1345 1350 1355 



agt ttc agt gtg gtc gtt gat ggc aat cag gaa gtg cct gcg gac acc 
Ser Phe Ser Val Val Val Asp Gly Asn Gin Glu Val Pro Ala Asp Thr 

1380 1385 1390 

tgg acg ctg gat ttt aag gec tgt gca ttg gcg cag gag gat acg tag 



40992 



41040 



41088 



41136 



41184 



41233 



aggagagaat a atg tgg atg aaa ata cag cga gtg aaa acg gtt ate tat 41283 
Met Trp Met Lys He Gin Arg Val Lys Thr Val lie Tyr 

12S5 1260 



41331 



41379 



41427 



41475 



41523 



41571 



acg gtg agt ggt cgt ggc gec att tta aga acc get gca gat aac gec 41619 
Thr Val Ser Gly Arg Gly Ala He Leu Arg Thr Ala Ala Asp Asn Ala 
1360 1365 1370 137S 



41667 



41715 
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Trp Thr Leu Asp Phe Lys Ala Cys Ala Leu Ala Gin Glu Asp Thr 
1395 1400 1405 



ccgtctgttc 


cactcatact 


tcctgtatca 


gtgaataagg 


cgaatactgt 


atatcaggtt 


4 1775 


cgaggatgta 


ttgaaaatct 


actcaaggca 


aatatggtta 


gcaatct t ca 


ttgcagaaaa 


41835 


atcaattgct 


gattgatttt 


aaaatctgaa 


attcatcttt 


tttgtaggga 


gaggaatatt 


41895 


atgtttcgta 


aaataattta 


acggacactg 


atgcctttat 


tatttctgtt 


ttgtggattt 


41955 


gtttgattca 


ctcttttgtg 


agcggggctt 


tataagcccc 


tggtataagg 


tctgtttatc 


42015 


tgtcaggtat 


tcatatgtga 


tatttaagat 


tttatgttta 


gggggctttt 


tcttgtcccc 


42 075 


tcaggcgtaa 


aaataattta 


ttttttacat 


aaaggaataa 


agcatatgtc 


ttatgcacga 


42135 


catttaccgg 


tattaatgta 


tcaccatgtc 


agtgataaac 


ccggacagat 


aaccttatc t 


42195 


ccccgtacgt 


cccgggcgca 


gatgaaatgg 


ctggccgaat 


ctggctggaa 


aaccgt tact 


4 2255 


gcctgcagag 


gnggaagctt 


tttatcatgg 


tgcaagattg 


cctcgtaaaa 


gcgtcatgct 


/•OTIC 

4 2 315 


gacctttgat 


ggcggctggc 


tggataactg 


gttgcaggtt 


tttccggcgc 


tgcaggagtt 


4 2 3 7 5 


taatctgcat 


gcgcatctct 


ttcttgtgac 


cagtttgatc 


agtgacggac 


cggtc cgtat 


42435 


tcctgcaggc 


gaaccggtgt 


actctcatga 


tgagtgtcaa 


atgctggtta 


aacaaggccg 


42 4 95 


ggctgatgag 


gtcatgctgc 


gctggtcaga 


ggtccgggag 


atgcacctca 


gtggcct tgt 


/I 1 c c c 


tgagtttcac 


tcgcacacgc 


acacccaccg 


acgctgggac 


cagaagcctg 


tgtcccgtaa 


42 615 


tccgtcggat 


ttgcttcgtg 


tcgatattct 


tcttagtcgt 


aagcggatga 


gggagatgct 


42675 


gggttattgc 


agtcagcatc 


tgtgctggcc 


tgagggctgg 


tattgttctg 


actatat tea 


42735 


tgtggctgaa 


gagttggggt 


tcacataccc 


gtataccaca 


gaaaggcgca 


tgaacaaccc 


4z / 


agtcatcggt 


tcacagcgta 


ttggt cgtat 


caacgcaaag 


gagcgaaaga 


a t_g t.gggcu.y 


A O R ^ 


gctgaaacgt 


cgtctgtttt 


atcacaccac 


gcccggattt 


tctccgccgc 




*» ^ -? X-J 


taagggggca 


cgtcggatag 


ctgactgagc 


cagagaccag 


gacgaaagug 


^ ^ t f~* s +" a /™ 

C_ L_ y d UaL.L.y 


^ / j 


aatcctcccc 


catcatcggc 


gggcaggggt 


tgtaggccat 


^ y — » — \ -^i ^ f^f 

dCCCCaoaLU 


ci l. y y i_y l_ i_ y a 


** -3 \J J J 


ttttgagcag 


aaataagata 


acaccggagg 


ccgtgataga 


cccggttgcg 


cagaatctgg 


43095 


aggcagcagg 


tcactggcgc 


agggcttcta 


cccgctggct 


tttggttatg 


ggagattttg 


43155 


aatgtacaga 


ggcccagcgg 


gagtggttgt 


tgttgcgccg 


gaattattgc 


ettgegcaga 


43215 


tatcttcccc 


tggcgccagt 


aaagctggat 


atcagcgacg 


tgacgaaatg 


agaggagect 


43275 


gcacataggt 


ttgtgttttg 


agtaagtgtc 


ttatttaaac 


cgtctgttct 


gtctcctccg 


43335 


ctttcacaaa 


taatgtcgag 


ccgggtgggg 


gactcaagta 


agaataatct 


ggcgatgttt 


43395 


tgcttgtttc 


cacgggatac 


tttgttaggt 


gaacgataca 


attaatgcgc 


tccagactgc 


43455 
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gccacagaaa 


aggagaacgc 


ctttgcagaa 


cgggcgcacg 


acaatgaacc 


actttattat 


43515 


tggtatctgg 


ttcggcgaca 


accttgacga 


tcacgtctga 


gatagggtat 


ttacactgag 


43575 


tggtaaacag 


gttcaattag 


taaccggaga 


tggatgcaaa 


atcatgatcg 


attcagatgc 


43635 


tattttgcag 


ccaatagatt 


ttttattaag 


atgatatcaa 


ggacattgag 


gcacacaatg 


43695 


acgtatcagt 


aagtcgttga 


tagctcattt 


gatacaagaa 


tttcttttat 


caacggaaga 


43755 


taatgatgga 


actgatcaat 


aatcgtggta 


tgcgagaccg 


gatgatattt 


attaaagtgg 


43815 


cggaagtagg 


gaatctttcc 


cgggctgcgc 


gggaattaga 


tattagcatt 


tctgctgtca 


43875 


gtaaatcgct 


tagtcgcctt 


gagaattcta 


ttgaggttac 


tttacttcgg 


cgcgattcac 


43935 


atcacttaga 


actgactgga 


gctggtcaga 


cagcctacgc 


aagcatgaaa 


aggataacat 


43995 


cctccttcca 


gtccttgctg 


gatgaattgc 


gaaatccgga 


taaaattatc 


agagggagna 


44055 


caaaattttc 


ggctccggct 


attgtctgtg 


agtttcttgc 


caataagtgg 


atatgggaat 


44115 


ttacagctag 


ctatccggat 


acaaaaatct 


acctggatcc 


acgagagcgt 


agcgattctt: 


44175 


ttagtaaatc 


cctggagttt 


gatgagctgg 


tttttaaaag 


tggcataatc 


gaaagtgagg 


44235 


atctcgtgta 


tcgaaagata 


agccctttaa 


agccggttct 


ttgtgcgagt 


ccgaaatata 


44295 


tcagaaaata 


tggcaggatc 


tcacaccctg 


gcgatctgga 


aaatcacatt 


attgtgggcc 


44355 


ttcacaacca 


tggtctttcc 


ggacctctta 


ctcttttccg 


tcaggatgaa 


tcatacacta 


44415 


ttagtggcgc 


tgttaatgtt 


catttatctt 


ccaataatct 


tttgagtgtt 


cttaatttgg 


44475 


ttttagaagg 


aaagggtatc 


aacctcatga 


ctccggcctg 


gcttgccacc 


aaatacttaa 


44535 


aaaataatga 


acttgaaatt 


atacttcctg 


aacggagggt 


tccagatctc 


cccatttatc 


44595 


ttgcatggcg 


tcatcgtcag 


tattattctc 


ctttatttca 


acgctttctg 


tcttttatcg 


44655 


aagataaatg 


gaataatcgc 


ccacaaattg 


actttctgaa 


tgatgactaa 


cccgtttgga 


44715 


atggtttcga 


tacgttcctg 


acttaaaccc 


acatgatgac 


tgaattgagg 


catcgagaca 


44775 


tgcgactggt 


cagccagtcg 


tcttttgacg 


atgcccaata 


caaaacatga 


tgccgactga 


44835 


cggaaatgat 


aatacgcgga 


aacaggacgg 


ggctgttfctt 


gggcagccgg 


aagttaagcc 


44895 


cataccagaa 


acgttgcagt 


gtactgaaaa 


atggcgccag 


gttgcacctg 


ttcaaagatt 


44955 


ttctgaaggc 


gcaggagtat 


tcattactga 


tatcctccat 


tgcgccttcg 


ggaacccaca 


45015 


ggaccagcta 


ttttaccgat 


agtgtttaaa 


aggcgtaagt 


aatgccgagc 


atgaagtcat 


45075 


tggaggcagc 


ctttgtgtct 


gcatcataag 


cggtatgttc 


atcaccagca 


tagcgatttt 


45135 


ttgaaatgct 


tactttgcca 


gcattaatgc 


acttataact 


ggcgtcaatc 


ataacattat 


45195 


ctgtcacagc 


atattttgca 


ccgatacctg 


cgccccaggc 


aaagttattt 


ttrtgaagcag 


45255 


acagagtttc 


attaatacca 


aaaccaacag 


gaatggtgnt 


attactcagc 


ttcacatgag 


45315 
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cgaggccaac 


gcctgcgctg 


atatagggag 


taaatgccgt 


actattgtga 


aaatcataat 


45375 


agccattaac 


catgtaagtg 


gtcattcgga 


cctgattttt 


tacatttatg 


tgtactggat 


45435 


caccaaatgc 


aataatatcc 


tgcccgcctt 


tagcatccgt 


ctcacctctg 


aaagtggtat 


45495 


ccagttctaa 


acgtactgga 


agctggaatg 


gatcataaaa 


gtcataaccg 


atagcaaccc 


45555 


cgccgccaaa 


aacgcctttg 


gtacggtcag 


gtaacgttgc 


atgaccatta 


actatctcat 


45615 


cctggctgaa 


ggttgagttg 


attccataga 


cattgactac 


ggatgtcccc 


gctttcccgg 


45675 


tgatatagat 


cccttctttt 


gctgatgcag 


tagcggacca 


ggctaccaca 


aggggaatga 


45735 


tgcagactgc 


gaaaaagttt 


ttcatttcag 


aacctgcctt 


aatattgggc 


taaaagacaa 


45795 


gtttcacggt 


atagggtgtg 


atataacgat 


tacataaacg 


aagcccaaaa 


aacggtctat 


45855 


tgtaacgctg 


ggttttctgt 


aagcgggtaa 


aaaatgagat 


gaagatttta 


aataacaata 


45915 


cgataatcgt 


cggtatggaa 


atccatctcc 


tcgccaaat t 


gccccacgta 


cggtttcact 


45975 


tctacgttat 


gtaacgggta 


gtgtgagatg 


gagcgatgct 


gtaagaaaaa 


gatgaagatg 


46035 


aatttgtacc 


cgacctggat 


aaagcccgtt 


atcccggaat 


aacgggcaaa 


aatatttact 


46095 


caagtgcctg 


ggcgagatct 


tgttgtacct 


gttgacgctg 


ttctggtgtt 


aagactttgc 


46155 


ttaaatcaaa 


ataatattta 


acccgataat 


agcgagcctg 


ttgttctatg 


ttactgaagg 


46215 


ctgcaagctg 


ctgttttacg 


gcggcgtcat 


cccatttacc 


ggatttaatc 


acctctatca 


46275 


gcgcaccgtc 


tttaattccc 


ttcatagaaa 


tctgactgac 


gtcggtttcc 


agttgttggt 


46335 


gaagtttttt 


gatccgggta 


atctgatcgt 


ttgtcagctt 


cagatgctgg 


acaataggat 


46395 


cctgggcggg 


caggggagga 


t-ggggacag 


cggtggcgaa 


agcgccaaaa 


gaaacgcccg 


46455 


ccagagtcgc 


tgccagtaaa 


gttgtgcgta 


caaagttttt 


catgaagata 


tcctgataag 


46515 


ggagtgatta 


accgttttta 


ttacccacga 


atggcgagca 


attatcttag 


agcctatccc 


46575 


agtagggcta 


ttttacttgc 


cattttggac 


ctgggcagtg 


ctcgccaaaa 


cgcgttagcg 


46635 


ttttgaacgc 


cgctagcggc 


ggcccgaagg 


gcgagcgtag 


cgagtcaaac 


ctcacgtact 


46695 


acgtgtacgc 


tccggttttt 


gcgcgctgtc 


cgtgtccaaa 


ctggctgcgc 


caataacgcc 


46755 


tggtgggata 


ggctcttagt 


cagaatacgt 


tgcctgccac 


attacgccac 


gcgaatttgt 


46815 


tttacggaga 


gttacggagt 


gaaacaatcc 


cgccgcggtg 


agcggcaggt 


tgctt 


46870 



<210> 2 
<211> 166 
<212> PRT 

<213> Salmonella typhimurium 
<400> 2 

Met Lys Ser lie Lys Lys Leu lie lie Ala Ser Ala Leu Ser Met Met 
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1 5 10 15 

Ala Ala Ser Cys Tyr Ala Gly Ser Phe Leu Pro Asn Ser Glu Gin Gin 
20 25 30 

Lys Ser Val Asp lie Val Phe Ser Ser Pro Gin Asp Leu Thr Val Ser 
35 40 45 

Leu lie Pro Val Ser Gly Leu Lys Ala Gly Lys Asn Ala Pro Ser Ala 
50 55 60 

Lys lie Ala Lys Leu Val Val Asn Ser Thr Thr Leu Lys Glu Phe Gly 
65 70 75 80 

Val Arg Gly lie Ser Asn Asn Val Val Asp Ser Thr Gly Thr Ala Trp 

85 90 95 

Arg Val Ala Gly Lys Asn Thr Gly Lys Glu lie Gly Val Gly Leu Ser 
100 105 110 

Ser Asp Ser Leu Arg Arg Ser Asp Ser Thr Glu Lys Trp Asn Gly Val 
115 120 125 

Asn Trp Met Thr Phe Asn Ser Asn Asp Thr Leu Asp He Val Leu Thr 
130 135 140 

Gly Pro Ala Gin Asn Val Thr Ala Asp Thr Tyr Pro He Thr Leu Asp 
14S 150 155 160 

Val Val Gly Tyr Gin Pro 

165 



<210> 3 

<211> 24S 

<212> PRT 

<213> Salmonella typhimurium 

<400> 3 

Met Lys He Val Asn Phe Ala Val Met Ala Val Ala Leu Phe Ala Thr 
1 5 10 15 

Asn Ser Met Val Ser Val Tyr Ala Val Asn Gin Gin Leu Asn Ser Ala 
20 25 30 

Thr Lys Leu Phe Ser Val Lys Leu Gly Ala Thr Arg Val He Tyr His 
35 40 45 

Ala Gly Thr Ala Gly Ala Thr Leu Ser Val Ser Asn Pro Gin Asn Tyr 
50 55 60 

Pro He Leu Val Gin Ser Ser Val Lys Ala Ala Asp Lys Ser Ser Pro 
65 70 75 80 

Ala Pro Phe Leu Val Met Pro Pro Leu Phe Arg Leu Glu Ala Asn Gin 

85 90 95 

Gin Ser Gin Leu Arg He Val Arg Thr Gly Gly Asp Met Pro Thr Asp 
100 105 HO 

Arg Glu Thr Leu Gin Trp Val Cys He Lys Ala Val Pro Pro Glu Asn 
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115 

Glu Pro Ser Asp 
130 

Ser He Asn Ala 
145 

Gly Thr Pro Glu 



Asn Lys Leu Lys 
180 

Ser Val Thr Val 
195 

Pro Phe Ala Asp 
210 

He Glu Trp Arg 
225 

His Tyr Val Leu 



120 

Thr Gin Ala Lys 
135 

Cys Asp Lys Leu 
150 

Asp Val Ala Gly 
165 

Val Glu Asn Pro 



Gly Gly Lys Pro 
200 

Lys Thr Leu Asn 
215 

Val lie Thr Asp 
230 

Lys 
245 



32 



Gly Ala Thr Leu 
14 0 

He Phe Arg Pro 
155 

Asn Leu Arg Trp 
170 

Thr Pro Phe Tyr 
185 

He Thr Gly Leu 



Met Pro Gly Ser 
220 

Phe Gly Gly Glu 
235 



125 

Asp Leu Asn Leu 



Asp Ala Val Lys 
160 

Val Glu Thr Gly 
175 

Met Asn Leu Ala 
190 

Glu Tyr Val Pro 
205 

Ala His Gly Asp 



Ser His Pro Phe 
240 



<210> 4 

<211> 836 

<212> PRT 

<213> Salmonella typhimurium 



<400> 4 

Met Lys Phe Lys 
1 

His Cys Ala Asn 
20 

Asp Ala Ala Lys 
35 

Gin Pro Gly Thr 
50 

Asp Thr Arg Asp 
65 

Pro Val Leu Ala 



Val Lys Thr Glu 
100 

Asp Glu Cys Ala 
115 

Asp He Asn Asn 
130 

Arg Pro Glu Phe 



Gin Pro Ala Leu 
5 

Ala His Thr Tyr 



Gly Val Asp Met 
40 

Tyr Arg Val Asp 
55 

Val Val Phe Lys 
70 

Pro Cys Leu Thr 
85 

Asp Tyr Pro Gin 



Asp Leu Thr Ala 
120 

Gin Gin Leu Gin 
135 

Lys Gly He Ala 



Leu Leu Phe He 
10 

Thr Phe Asp Ala 
25 

Ser Leu Phe Asn 



Val Met Val Asn 
60 

Leu Glu Lys Asp 
75 

Val Ser Gin Leu 
90 

Leu Trp Lys Ala 
105 

He Pro Gin Ala 



Leu Ser He Pro 
140 

Pro Glu Asp Leu 



Ala Gly Val Val 
15 

Ser Met Leu Gly 
30 

Gin Gly Leu Gin 
45 

Gly Lys Arg Val 



Gly Gin Gly Thr 
80 

Ser Arg Tyr Gly 
95 

Ala Lys Pro Pro 
110 

Lys Ala Val Leu 
12 5 

Gin Leu Ala Leu 



Trp Asp Asp Gly 
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lie Pro Ala Phe 



Tyr Lys Met Asp 
180 

Gin Pro Gly lie 
195 

Trp Gin Arg Ser 
210 

Tyr Ala Glu Arg 
225 

Gin Lys Thr Ser 



Val Met Leu Ala 
260 

Phe Ala Pro Val 
275 

Val Lys Gin Asn 
290 

Pro Phe Ala Leu 
305 

His Val Thr Val 



Pro Tyr Gin Thr 
340 

Ser Leu Leu Ala 
355 

Arg Gin lie Ala 
370 

Thr Ala Tyr Gly 
385 

Leu Gly Leu Gly 



Gly Ser Asp Thr 
420 

Ala Ser Trp Arg 
435 

Asn Phe Phe Leu 
450 

Leu Ser Asp Val 
465 



ISO 

Leu Met Asn Tyr 
165 

Met Val Gly Arg 



Asn lie Gly Ala 
200 

Ser Gin Leu Ser 
215 

Gly Leu Tyr Ser 
230 

Gin Gly Glu lie 
245 

Ser Asp Asp Asn 



Val Arg Gly He 
280 

Gly Tyr Thr He 
295 

Arg Asp Leu Ser 
310 

Trp Glu Ala Asp 
325 

Pro Ala He Ala 



Gly Arg Tyr Arg 
360 

Gin Ala Thr Leu 
375 

Gly He Gin Ser 
390 

Gly Ser Leu Gly 
405 

His Ser Gin Arg 



Leu Arg Tyr Ser 
440 

Thr Arg Trp Gin 
455 

Leu Asp Ser Tyr 
470 



33 

155 

Ser Ala Arg Thr 
170 

Asp Asn Ser Ser 
ias 

Trp Arg Val Arg 



Gly Lys Trp Gin 
220 

Leu Lys Ser Arg 
235 

Phe Asp Ser Val 
250 

Met Val Pro Tyr 
265 

Ala Arg Thr Gin 



Tyr Asn Thr Thr 
300 

Val Thr Asp Ser 
315 

Gly Ser Thr Gin 
330 

Leu His Gin Gly 
345 

Ser Ser Asp Ser 



Met Tyr Gly Leu 
380 

Ala Thr His Asn 
395 

Arg Trp Gly Ser 
410 

Gin Gly Glu Ala 
425 

Asn Gin Leu Thr 



Tyr Ala Ser Gin 
460 

Arg His Asn Gly 
475 



160 

Thr Gin Thr Asp 
175 

Trp Val Gin Leu 
190 

Asn Ala Thr Ser 
205 

Ala Ala Tyr Thr 



Leu Thr Leu Gly 
240 

Pro Phe Thr Gly 
255 

Ser Glu Arg Gin 
270 

Ala Arg val Glu 
285 

Val Ala Pro Gly 



Ser Gly Asp Leu 
320 

Met Phe Val Val 
335 

Tyr Leu Lys Tyr 
350 

Ala Thr Asp Lys 
365 

Pro Trp Asn Leu 



Gin Ala Ala Leu 
400 

Leu Ser Val Asp 
415 

Val Gin Gin Gly 
430 

Ala Thr Gly Thr 
445 

Gly Tyr Asn Thr 



Asn Arg Leu Trp 
480 
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Ser Trp Arg Glu 



Ser Gin Ser Trp 
500 

Arg Thr Asp Trp 
515 

Ser Trp Gly Thr 
530 

Gin Asn Arg Thr 
545 

Thr Ser Leu Trp 



Val Ser Ala Ser 
580 

Gin Gin Val Gly 
595 

Glu Val Arg Gin 
610 

Asn Ser Ala Leu 
625 

Gly Asp Tyr Ser 



Ala Gly Gly He 
660 

Leu Gin Gly Ser 
675 

Pro Val Gly Gly 
690 

Thr Val Gly Asn 
705 

Pro Ser Arg Leu 



Val Val Pro Thr 
740 

He Gly Ala Arg 
755 

He Pro Phe Gly 
770 

Ala Leu Val Asp 
785 



Asn Leu Gin Pro 
485 

Gly Arg His Leu 



Arg Asn Arg Pro 
520 

Ser He Gly Gly 
535 

Leu Trp Arg Asn 
550 

Phe Ser Met Pro 
565 

Trp Gin Met Thr 



Val Asn Gly Glu 
600 

Ser Tyr Arg Ala 
615 

His Leu Ala Trp 
630 

Tyr Ser Arg Ala 
645 

Val lie His His 



Val Ala Leu Val 
680 

Trp Pro Gly Val 
695 

Leu Asn Val Tyr 
710 

Pro Asp Asp Ala 
725 

Glu Gly Ala Val 



Ala Leu Met Thr 
760 

Ala Gin Val Thr 
775 

Thr Asp Ser Gin 
790 



Ser Ser Arg Thr 
490 

Gly Asn Leu Ser 
505 

Gly His Asp Asp 



Gly Ser Leu Ser 
540 

Gly Ala His Arg 
555 

Leu Ser Arg Trp 
570 

Ser Pro Ser His 
585 

Ala Phe Ser Gin 



Asp Ala Pro Pro 
620 

Asn Gly Asp Tyr 
635 

Met Arg Gin Met 
650 

His Gly Val Thr 
665 

Glu Ala Pro Gly 



Lys Thr Asp Phe 
700 

Gin Glu Asn Thr 
715 

Glu Val Thr Gin 
730 

Val Glu Ala Lys 
745 

Leu Lys Arg Glu 



Val Asn Gly Gin 
780 

Val Tyr Leu Thr 
795 



Thr Leu Met Leu 
495 

Leu Thr Gly Ser 
510 

Ser Tyr Gly Leu 
525 

Leu Asn Trp Asn 



Lys Glu Asn He 
560 

Thr Gly Asn Asn 
575 

Gly Gly Gin Thr 
590 

Gin Leu Asp Trp 
605 

Gly Gly Gly Asn 



Gly Leu Leu Gly 
640 

Gly Val Asn He 
655 

Leu Gly Gin Pro 
670 

Ala Ser Gly Val 
685 

Arg Gly Asp Thr 



Val Ser Leu Asp 
720 

Thr Asp Val Arg 
735 

Phe His Thr Arg 
750 

Asp Gly Ser Ala 
765 

Asp Gly Ser Ala 



Gly Leu Ala Asp 
800 
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Lys Gly Glu Leu Thr Val Lys Trp Gly Ala Gin Gin Cys Arg Val Asn 

805 810 815 

Tyr Arg Leu Pro Ala His Lys Gly He Ala Gly Leu Tyr Gin Met Ser 
820 825 830 

Gly Leu Cys Arg 
835 



<210> 5 

<211> 156 

<212> PRT 

<213> Salmonella typhimurium 



<400> 5 

Met Trp Met Lys He Gin Arg Val 
1 5 

Leu Leu Val Ala Ala Ser Ser Leu 
20 



Lys Thr Val He Tyr Ser Val Ser 
10 15 

Val Pro He Ala Asn Ala Ala Glu 
25 30 



Lys Leu Gin Thr Thr Leu Arg Val 
35 40 

Val Pro Asp Gly Met Val Leu Ala 
50 55 

Ser His Ser Gly Phe Arg Val Trp 
65 70 

Thr Pro Thr Val Leu Leu Leu Ser 

85 



Gly Thr Tyr Phe Arg Ala Gly His 

45 

Gin Gly Trp Val Thr Tyr His Gly 
60 

Ser Asp Glu Gin Lys Ala Gly Asn 
75 80 

Gly Gin Gin Asp Pro Arg His His 
90 95 



He Gin Val Arg 
100 

Gly Arg Gly Ala 
115 

Val Val Val Asp 
130 

Asp Phe Lys Ala 
145 



Leu Glu Gly Glu 



He Leu Arg Thr 
120 

Gly Asn Gin Glu 
135 

Cys Ala Leu Ala 
150 



Gly Trp Gin Pro 
105 

Ala Ala Asp Asn 



Val Pro Ala Asp 
140 

Gin Glu Asp Thr 
155 



Asp Thr Val Ser 
110 

Ala Ser Phe Ser 
125 

Thr Trp Thr Leu 
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SEQUENCE LISTING NO. 2 

<110:> Folkesson, Anders 

<120> The complete sequence of the tcf insert of Salmonella 
encerica serovar Typhi . 

<130> The tcf insert m Salmonella typhi 

<140> 
<141> 

<160> 6 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 92S3 
<212> DNA 

<213> Salmonella typhi 

<220> 
<221> CDS 

<222> (1898) . . (2608) 

<223> tcfA putative fimnrial subunit 

<220> 
<221> CDS 

<222> (2659) - . (3234 ) 

<223> ccfB putative fimbrial subunit 

<220> 
<221> CDS 

<222> (3360) . . (6029) 

<223> tcfC putative fimbrial subunit 

<220> 

<221> CDS 

<222> (6052) . . (7131) 

<223> ccfD putative firnbrial subunit 

<220> 
<221> CDS 

<222> (7264) . - (7719) 

<223> tinR putative transcriptional regulator 
<400> 1 

tgtaagtatc ccgcataatc gagccattca catttagaga tcatccggca taatcaatct 60 
gccaacgcag gagatcgctg tgcgtaaagc ccgtattact gcgcaccaga tcatcgctgt 120 
gattagatca gttgaatccg gacggactgt taaagatgtc taccgggagg ccggtatttc 180 
tgaagccacc agggacaact ggaagtctgg atacggcggc atggaagctt ctgatattaa 240 
atcttgagga tgtcaacgcc aggatttatg gtcgtttttc gctattttta atatccgctg 300 
tttgatcact tctgctgtcc gctttccgcc atttcatcct cactgattgg cgttgcgctt 360 
ttggtcagcg ccccgacttt gcgtttttcc ttcagttggt aactttctcc tttgatattc 420 
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agtgcggtcg agcgatgcag cagccgatcc aggatcgctg ccgccagcac gtcaccaccg 480 

aacacctccc cccagtccgc gaagcctctg ctcgacgcca ggacaatgct: cgctctttca 540 

caccgtcggc ccagcagccg gaagaacaga ccggcttccc cactggccat tggcaggtaa 600 

cccacttcgn ccaggaccag cccccgcgca tagctcagcc grtgcagccg gcgccccagg 660 

C gg CCCtcca gcctcgc:tt catcagcgcc taccaacctg ccctgcggca tgaacaacac 720 

ccgacggccc acatctgccg ccttcacacc gggggcagcg gccaggtggg tttcccccac 780 

cccgggggcc ccagcaggat cacacccccg cagcgctcca cgaacgccag accggccaac 84 0 

tcccggacga ccttacgacc gacgcctggc tggaagctga agtcgaactg ctccagcgct 900 

ccgactcacg gcagacgagc ccgcctcagc cgggactcca ccccgcgctg atgtccgccg 960 

ttccaccccc gctgcagcgc catgcacagg aacgccgact: cattctgggc cgacagaggc 1020 

at cggccg^t: aagactcgca tacccacgtc gcttacccgc acaaaagggg gaaaggaacc 1080 

gcaaacaaca ttgacaggta agggactggc ancaacnnga atggcat:t::t cracaacac: 1140 

acgttccaaa aggcgaattg tatcctcatg gcgagttgtt ctgcttcaca ccgttgtccg 1200 

cggaacac-t agccaacac: caatctaaaa atagcgtacn acactgacca cacttcgcrg 1260 

gagaggcgta aacatcaagc giaaccatac gatgcacaca agtttttgcc tgctgatacc 1320 

atattcctaa cagatattga gcaagtaaat cngacctaac tgtaacttta gctttctaaa 1380 

ttaatcacct tgctcacttt tttctacctg gttttgtaaa acctcatgat gaccagtcta 1440 

ttcgcggctt tgtttaaggt tcacttggtt tcgtagtctg aatcgcatcc ccctccrgga 1S00 

gggcttcccc tagatcc:ct gtctcccacg ctgttatgtt acgtarttgt tgctctgaag 1550 

gagggggaaa tacagttcca tccatctgag caagtcaggc acacagtaac aac:::c::a 1620 

tgaagaattt ccaaaattt: tactgcggcg ttattaattg ttcagcgatr cttacagatc 1680 

cgtcgttcgc ttttggtgaa cgaaacccgc ggacttttat ttaccaattt tctcttccct 1740 

gaaaaaaaca gaggtactga gcgaaaaatt ctaccccgca cgatgccctc cacacaaaat 1800 

gcattaacac tgaatcgtaa ttcgcttctc cacgctgata accctccgtc tatgccaata I860 

ctaaaactca gacgaccttt atacggtaaa acctggt atg aat etc aaa gat act 1915 

Met Asn Phe Lys Asp Thr 
X 5 

ctt ccc ggg gtg ttt c:c cat gtc get atg ttt gca tgt ggt cat gec 1953 
Leu Pro Gly val Phe Leu Cys Val Ala Met Phe Ala Cys Gly His Ala 
10 15 20 

agg gcg aat atg ccc gtt cat ccc acg gcg gca gaa att aat agt age 2011 
Arg Ala Asn Met Leu Val Tyr Pro Mec Ala Ala Glu lie Asn Ser Ser 
25 30 35 
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cgc gaa gag gcc acc teg ctg ttc gec tat tct aaa tea gat cat gtg 2059 
Arg Glu Glu Ala Thr Ser Leu Phe Val Tyr Ser Lys Ser Asp His Val 
40 45 50 



2107 



caa tar art cga aca aga ate atg cgt act gaa cac ccc ggt atg cca 
Gin T*— He Arg Thr Arg He Met Arg He Glu His Pro Gly Met Pro 
55 60 65 70 

cag gaa aag gag gta cca gca ggg aat gat ata gag aca gga etc gtt 
Gin Cu Lys Glu Val Pro Ala Gly Asn Asp He Glu Thr Gly Leu Val 

75 80 85 

gtc tec ccg gag aaa ttt get ctt tec ccg gga aca aaa aaa aca ata 
Val Ser Pro Glu Lys Phe Ala Leu Ser Pro Gly Thr Lys Lys Thr He 
90 " 9S 100 

cgt gtt ate agt act cag gca ccg gaa aga gag gaa gcc tag egg gta 
Arg Val He Ser Thr Gin Ala Pro Glu Arg Glu Glu Ala Trp Arg Val 
105 HO 115 

t:ac etc aag get gtt cct gaa ctg gaa gat gat cca cag gca ggc gga 
Tyr Phe Glu Ala Val Pro Glu Leu Glu Asp Asp Pro Gin Ala Gly Gly 
120 125 130 

aag caa aat tea tec gta agt gtg aat etc gtc tgg ggg gtg ttg ctg 
Lys Gin Asn Ser Ser Val Ser Val Asn Leu Val Trp Gly Val Leu Leu 
!3 5 140 145 150 

cgt gtt tct ccg tea gac ccc agg cct gcg ctg gca acg gac ggt cac 
Arg Val Ser Pro Ser Asp Pro Arg Pro Ala Leu Val Thr Asp Gly His 

155 160 165 

cac ctg ctg aat acg gga aac aca egg ctt tct ctt att egg get ggc 
His Leu Leu Asn Thr Gly Asn Thr Arg Leu Ser Leu lie Arg Ala Gly 
170 175 180 

aac tgc gac acc aca tgc cac tgg cag aat ata ggc aaa agt att tat 
Asn Cys Asn Thr Thr Cys His Trp Gin Asn He Gly Lys Ser He Tyr 
185 190 195 

ccc acc ggg age get gat att ccg gcc gga ata aaa agt aat gca ttt 
Pro Glv Gly Ser Ala Asp He Pro Ala Gly lie Lys Ser Asn Ala Phe 
200 205 210 

cgt gtg gaa tat cgt acg ggt gca aat tea ccg gta ate tct get gat 
Ara VaT Glu Tyr Arc Thr Gly Ala Asn Ser Pro Val He Ser Ala Asp 
21 i - 220 225 230 

tta aca gca gcc gga aag taa aaacacaegg agegtacget ataccctaca 
Leu Thr Ala Ala Gly Lys 

235 

tttactctca gggggagegg atg tat acc gag tgt aca tat ate act gta ata 2691 

Met Tyr Thr Glu Cys Thr Tyr lie Thr Val lie 



2155 



2203 



2251 



2299 



2347 



2395 



2443 



2491 



2539 



2587 



2638 



Tyi 

240 245 



aac aac aaa gca agg tta ttt ttt atg aac atg aaa aca tct ttt att 
Asn Asn Lys Ala Arg Leu Phe Phe Met Asn Met Lys Thr Ser Phe He 



2739 
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250 255 260 

gcc gca get gtg gca teg gec ace get tat tct ttt tct gtt tct gcg 

Ala Ala Ala Val Ala Leu Ala Thr Val Tyr Ser Phe Ser Val Ser Ala 
265 270 275 280 

gtt cag aag gat ate acc gec acc gcc aac act gac agt aca cee gaa 

Val Gin Lys Asp lie Thr Val Thr Ala Asn lie Asp Ser Thr Leu Glu 

285 290 295 

ctg ctg cag gcc gac ggt tea tec etc ccg teg acc atg aag ctg gac 

Leu Leu Gin Ala Asp Gly Ser Ser Leu Pro Ser Thr Mec Lys Leu Asp 
300 305 310 



2787 



2835 



28B3 



tec atg ccg ggt aag ggc ctg gec cat aaa tea etc cag acc cgc etc 2931 
Phe Met Pro Gly Lys Gly Leu Val His Lys Ser Leu Gin Thr Arg Leu 
315 320 325 



tac age aac gac cag acc aag teg gtt aat gta aaa ctg ttg aat gcc 2979 
Tyr Ser Asn Asp Gin Thr Lys Ser Val Asn Val Lys Leu Leu Asn Ala 
330 335 340 

cca caa ccc ate aac gtc ccg gac ccc acc aaa acc acc gat atg gaa 3027 
Pro Gin Leu lie Asn Val Leu Asp Pro Thr Lys Thr lie Asp Met Glu 
345 350 355 360 

gcg act ctg gga gga egg tea ccg acc acc acc aac ccc gta ctg gaa 3075 
Val Thr Leu Gly Gly Arg Ser Leu Thr Thr Thr Asn Ser Val Leu Glu 

365 370 375 

get aaa acc ccg ccc ccg gac gga aaa acc ggc gac gcc tea get ctg 3123 
Ala Lys Thr Leu Phe Pro Asp Gly Lys Thr Gly Asp Ala Ser Ala Leu 
380 385 390 

ctg aac ctg gac att ggt cag aag gcc gga gca gcc cca caa aac ccg 
Leu Asn Leu Asp lie Gly Gin Lys Ala Gly Ala Ala Leu Gin Asn Leu 
395 400 405 

cct gcc ggc gaa cac age gga ccg gcc age ccg gcg acc Cca cag gcc 
Pro Ala Gly Glu Tyr Ser Gly Leu Val Ser Leu Val lie Ser Gin Ala 
410 415 420 

gtc act gcc ggc Caa eaactggtta teagcccetc atctgatccg gtettggggg 
Val Thr Ala Gly 
425 

gcaccgcccg tacctgaacc ggatceggea ecgaccccat tattcattgc aactcaggtc 3334 

tetttaegtg agtcgttatt tctgg atg tat tat tta ctg gga ttg tgc age 3386 

Met Tyr Tyr Leu Leu Gly Leu Cys Ser 
430 435 



3171 



3219 



3274 



ett acc age cag gca act ctt att ccc cct cct gga ttt gaa tct ccg 

Phe Thr Ser Gin Ala Thr Leu lie Pro Pro Pro Gly Phe Glu Ser Leu 

440 445 450 

ctg gaa gga cag ace gag caa ate gaa gcg ctg eta cca ggg cat tea 

Leu Glu Gly Gin Thr Glu Gin lie Glu Val Leu Leu Pro Gly His Ser 

455 460 465 470 



3434 



3482 
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ccg gga era tec ccg gtg gcg gtt aaa ccg gac acc gcg cag etc atg 3530 
Leu Gly Leu Phe Pro Val Val Val Lys Pro Asp Thr Val Gin Phe Met 

475 480 485 

tec cca ttg atg gca etc gaa age age ggg ctt gec gcg ttg ccg gec 3578 
Ser Pro Leu Met Val Leu Glu Ser Ser Gly Leu Ala Ala Leu Pro Ala 
490 495 500 

gca gaa egg caa aaa gcg ctg get gca etc age cgt ccg ttg eta cgt 3626 
Ala Glu Arg Gin Lys Ala Leu Ala Ala Leu Ser Arg Pro Leu Leu Arg 
505 S10 515 

aac age aat ctg gtc tgt ggt gtc tea gaa gca aaa gac age age gag 3674 
Asn Ser Asn Leu Val Cys Gly Val Ser Glu Ala Lys Asp Ser Ser Glu 
520 525 530 

tgt ggt tac gcg gca aca gat aaa gag gat get gcg gtt att ttt gat 3722 
Cys Gly Tyr Val Ala Thr Asp Lys Glu Asp Val Ala Val lie Phe Asp 
535 540 545 550 

gag aac aac get cag tea tct ttg ttt ctt aac egg gac tgg ttg ccg 3770 
Glu Asn Asn Ala Gin Leu Ser Leu Phe Leu Asn Arg Asp Tr? Leu Pro 

555 560 565 

gat gaa gaa cga cgt gat aaa cgc tgg ctg act ccg acc ccg gag ggt 
Asp Glu Glu Arg Arg Asp Lys Arg Trp Leu Thr Pro Thr Pro Glu Gly 
570 575 580 

gtc age gca ttt att cac cgc cag acg ctg tat ctg agt gat gat etc 
Val Ser Ala Phe lie His Arg Gin Thr Leu Tyr Leu Ser Asp Asp Leu 
585 590 595 

cac age cgt aat atg aca ctg aat ggt age ggt gec ctg ggg ctt ggt 3 914 
His Ser Arg Asn Met Thr Leu Asn Gly Ser Gly Ala Leu Gly Leu Gly 
600 605 610 

gac ggt cgt tat ctg gga ggc gac tgg gcg get ate tgg aat cag tea 3 962 
Asp Gly Arg Tvr Leu Gly Gly Asp Trp Ala Ala He Trp Asn Gin Ser 

620 * 625 630 



3819 



3866 



615 



4010 



gaa cat tac aat aac agt cag gec tgg ttt gac aat ctg ttt gec cgt 
Glu His Tyr Asn Asn Ser Gin Ala Trp Phe Asp Asn Leu Phe Val Arg 

635 640 645 

cag gat etc ggc aat cag tat tat etc cag get ggt egg atg gat cag 
Gin Asp Leu Gly Asn Gin Tyr Tyr Leu Gin Ala Gly Arg Met: Asp Gin 
650 655 660 

egg aac ctg tec age gec acg ggg ggg gat ttt ggg ttc agt ctg ctt 
Arg Asn Leu Ser Ser Ala Thr Gly Gly Asp Phe Gly Phe Ser Leu Leu 
665 670 675 

ccc ccg age egg ttt gat gga tta cga acc ggg acc acc caa get tat 4154 
Pro Leu Ser Arg Phe Asp Gly Leu Arg Thr Gly Thr Thr Gin Ala Tyr 
680 685 690 



4058 



4106 



gtc aac cat gag gtg gac cat aat gee act ccg gtt acg gee cag gtt 
Val Asn His Glu Val Asp His Asn Ala Thr Pro Val Mec Val Gin Val 
695 700 705 710 



4202 
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acc cga aac gcc cgc att gat acc cat cgt ggc age gag ccg ctg ggg 4250 

Thr Arg Asn Ala Arg lie Asp lie Tyr Arg Gly Ser Glu Leu Leu Gly 

715 720 725 

age cag ttc ccg acc ccg gga atg cat acc ccg gac acc cac ccc ccc 

Ser Gin Phe Leu Thr Pro Gly Met His Thr Leu Asp Thr His Ser Leu 
730 735 740 

cca ccg gga age cac ccc ccg gcg ccg egg gcg cac gag gac ggg acc 4346 

Pro Pro Gly Ser Tyr Pro Leu Ala Leu Arg Val Tyr Glu Asp Gly lie 
745 750 755 

ccg egg cga acg gag acc cag ccc etc age aag ggg ggc aac age ccc 4 3 94 

Leu Arg Arg Thr Glu Thr Gin Pro Phe Ser Lys Gly Gly Asn Ser Phe 
760 765 770 



4490 



4538 



4586 



age gca cag acc cag egg tec acc cag ggc ggg ccg gaa gac acc ggg 4442 
Ser Ala Gin Thr Gin Trp Phe lie Gin Gly Gly Leu Glu Asp Thr Gly 
775 780 785 790 

gac aaa ccc age cac tat gac ggc gag acc gcc atg gcc gcc gga ccc 
Asp Lys Ala Ser His Tyr Asp Gly Glu Thr Val Met Ala Ala Gly Phe 

795 800 805 

caa acc ggg ccg egg aaa aac acc age ccg acc gaa ggc acc ccc ccg 
Gin Thr Gly Leu Arg Lys Asn lie Ser Leu Thr Glu Gly lie Ser Leu 
810 815 820 

gca cac gag gcc egg cac age gaa acc cga ccg aac cca cag cac gca 
Ala His Glu Ala Trp Tyr Ser Glu Thr Arg Leu Asn Ser Gin His Ala 
825 830 835 

gcg ccg gac ggc acg ccg gac ccc ccc gcc ggg aca ccg cac ggg aca 4634 
Val Leu Asp Gly Thr Leu Asp Leu Ser Ala Gly lie Leu His Gly Thr 
840 845 850 

gac age acg age ggt aac act gag cag gcg aca cac aac gac gga CCC 4 682 
Asp Ser Thr Ser Gly Asn Thr Glu Gin Val Thr Tyr Asn Asp Gly Phe 
855 " 860 865 870 

ccc gcg age ccg egg cgc aac cac acg gaa age gac gcc tgc age ggc 
Ser Ala Ser Leu Trp Arg Asn His Thr Glu Ser Asp Ala Cys Ser Gly 

875 880 885 

cgc cac cca cag cca gcg cac gcc agt atg acc tgc cag act teg atg 
Arg His Pro Gin Ser Val His Ala Ser Met Thr Cys Gin Thr Ser Met 
890 895 900 

aac gcc ccc ccg ccg gcc teg gcg ggg aac tgg cac gcc cca ccg gga 
Asn Ala Ser Leu Ser Val Ser Val Gly Asn Trp Tyr Ala Leu Leu Gly 
90S 910 915 

cac age acc age agg aca gaa ggc egg ccg get tac egg gga tac gac 4 874 
Tyr Ser Thr Ser Arg Thr Glu Gly Arg Pro Val Tyr Arg Gly Tyr Asp 
920 925 930 



4730 



4778 



4826 



gat aac age gac aaa gaa aac gcg ccc egg cga cag gca tac ate ccc 
Asp Asn Ser Asp Lys Glu Asn Val Phe Trp Arg Gin Ala Tyr lie Pro 
935 * 940 945 950 



4922 
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gcc tct cac cgc gaa cct get cag get age gca acg tac age ctt aac 

Ala Ser His Arg Glu Ser Ala Gin Ala Ser Ala Thr Tyr Ser Leu Asn 

955 960 965 

atg gen ggc acg aac act aat ace cat ggg gga gca tgg cga acc cga 

Mec Ala Gly Met Asn lie Asn Thr His Gly Gly Val Trp Arg Thr Arg 
970 975 980 



tat gcc tct caa cca ccg aca atg act ggc agt aat agg tat acc tea 
Tyr Ala Ser Gin Pro Pro Thr Met Thr Gly Ser Asn Arg Tyr Thr Ser 
1000 1005 1010 



4970 



5018 



aac gac gga gcg aat gat gat ggc ttg ttt atg agt gtc agt gcg tea 5066 
Asn Asp Gly Val Asn Asp Asp Gly Leu Phe Met Ser Val Ser Val Ser 
98S 990 995 



5114 



gcc ggg acc gat act cac agt age egg aat caa aaa aca cag acg tec 
Ala Gly Thr Asp He His Ser Ser Arg Asn Gin Lys Thr Gin Thr Ser 
!015 ' * 1020 1025 1030 

egg aac gcg aac cat gtg aga tee egg cag cag gat ctg tac cgc gaa 
Trp Asn Val Asn His Val Arg Ser Trp Gin Gin Asp Leu Tyr Arg Glu 
1035 1040 1045 

ctg teg gcg ggc ttc tec ggt tat aac gac gac age tgg age ggg age 
Leu Ser Val Gly Phe Ser Gly Tyr Asn Asp Asp Ser Trp Ser Gly Ser 
1050 1055 1060 

etc ggc gga cgc atg age ggc cat atg ggt gaa ctg age gcc acc ate 
Leu Gly Gly Arg Met Ser Gly Arg Met Gly Glu Leu Ser Ala Thr He 
1065 1070 1075 

agt aac tec cat caa cgt aat gcg ggc age gcc agt tea etc ace gcc 
Ser Asn Ser His Gin Arg Asn Ala Gly Ser Ala Ser Ser Leu Thr Ala 
1080 1085 1090 

ggc tac age teg tec ctg gcg tta tec cgt aat gga ctg ttc tgg gga 
Gly Tyr Ser Ser Ser Leu Ala Leu Ser Arg Asn Gly Leu Phe Trp Gly 
1095 HOO 1105 HIO 

ggt ggt cag gac ggt gaa ccg gcc tct ggc atg gcg gcg aac gcg gag 
Gly Gly Gin Asp Gly Glu Pro Ala Ser Gly Met Ala Val Asn Val Glu 
1115 1120 H25 

tea gag ggg gac gag ggc agt age ggg aaa gta gtc age gcc cgc ggc 
Ser Glu Gly Asp Glu Gly Ser Ser Gly Lys Val Val Ser Val Arg Gly 
1130 1135 H40 

age age cag ccg tec agt etc ggt ttt ggt cag cag teg ccg ccg ctg 
Ser Ser Gin Pro Phe Ser Leu Gly Phe Gly Gin Gin Ser Leu Leu Leu 
1145 1150 H55 

acg gaa ggc cat aac gcc acg gag gcg acc att gag gat gca ggg gtc 
Mec Glu Gly Tyr Asn Ala Thr Glu Val Thr He Glu Asp Ala Gly Val 
1160 H65 1170 

agt tea cag ggt acg gca ggc gta aaa gcg gga ggg gga age agg tgc 
Ser Ser Gin Gly Mec Ala Gly Val Lys Ala Gly Gly Gly Ser Arg Cys 



5162 



5210 



5258 



5306 



5354 



5402 



5450 



5498 



5546 



5594 



5642 
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1175 



1180 



lias 



1190 



tac etc ctg aca ccc ggg cat ctg ctg get cac aac ate age gec agt 
Tyr Phe Leu Thr Pro Gly His Leu Leu Val His Asn He Ser Ala Ser 

1195 1200 1205 



5690 



atg age cga ctg tac get ggc cgc gta ctg gac aag gat ggc aga ccg 
Met Ser Arg Leu Tyr Val Gly Arg Val Leu Asp Lys Asp Gly Arg Pro 
1210 1215 1220 



5738 



ctg ctg gac gca cag cca ctg aac tat cca ttt ttg teg ttg gga cct 
Leu Leu Asp Ala Gin Pro Leu Asn Tyr Pro Phe Leu Ser Leu Gly Pro 
1225 1230 1235 

tec ggg cga ttt age ctg cag age gag cat aaa gaa tec age ctg tgg 
Ser Gly Arg Phe Ser Leu Gin Ser Glu His Lys Glu Ser Ser Leu Trp 
1240 1245 1250 

ctg ctg tec aaa aac agg ate ctg cgt tgt ccg atg tea gta cat aaa 
Leu Leu Ser Lys Asn Arg He Leu Arg Cys Pro Met Ser Val His Lys 
1255 1260 1265 1270 

cgt egg gat gtt acg cag gta grg ggt gat gtg egg tgt gaa tta agt 
Arg Arg Asp Val Met Gin Val Val Gly Asp Val Arg Cys Glu Leu Ser 

1275 1280 1285 



5786 



5834 



5882 



5930 



gac gtg gat gec ctg cca cag gcg ttg caa ata teg ccg egg gtc ate 
Asp Val Asp Ala Leu Pro Gin Ala Leu Gin lie Ser Pro Arg Val He 
1290 1295 1300 



5978 



cgt ttg ctg aac gtg gca ggt ttg ctg cgc cat tec gtt cag gaa gec 
Arg Leu Leu Asn Val Ala Gly Leu Leu Arg His Ser Val Gin Glu Ala 
1305 1310 1315 



6026 



tga cgtagagata aaggcgttaa ct atg agt aat aaa atg aag tgg acg agt 6078 

Met Ser Asn Lys Met Lys Trp Thr Ser 
1320 1325 



atg aca gec cat tgg tea gca att att aat ttc ate cga aaa tat gtt 

Met Thr Ala His Trp Ser Ala He lie Asn Phe He Arg Lys Tyr Val 

1330 1335 1340 

tat cca gca agg ata att gec ate ctg ctg atg get ggc get aca ctg 

Tyr Pro Ala Arg He He Ala He Leu Leu Met Ala Gly Ala Thr Leu 

1345 1350 1355 1360 

cca caa gtc gec gat gcg att acc gtc gac ctg aat tac gac aag aac 

Pro Gin Val Ala Asp Ala He Thr Val Asp Leu Asn Tyr Asp Lys Asn 

1365 1370 1375 



6126 



6174 



6222 



aat gta gcg gtc ate act cct gtc tgg tec caa gaa tgg agt gta gca 
Asn Val Ala Val He Thr Pro Val Trp Ser Gin Glu Trp Ser Val Ala 
1380 1385 1390 



6270 



aat gtg ttg ggg gga tgg gta tgt cgt tea aac agg aat gaa aat gag 
Asn Val Leu Gly Gly Trp Val Cys Arg Ser Asn Arg Asn Glu Asn Glu 
1395 1400 1405 



6318 



ggg gcg tgt gaa gaa aca cat ttg gta tgg tgg tat get ttt gga get 6366 
Gly Ala Cys Glu Glu Thr His Leu Val Trp Trp Tyr Ala Phe Gly Ala 
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1410 1415 1420 

tat tea aaa att cgt ctg cgt tec aga gaa caa ate age cat gec gaa 
Tyr Ser Lys He Arg Leu Arg Phe Arg Glu Gin He Ser His Ala Glu 
142 5 1430 1435 1440 

att acg etc ata ctg etc ggc agt gtt cgt gat gec tgt tat act ggt 
He Thr Leu He Leu Leu Gly Ser Val Arg Asp Ala Cys Tyr Thr Gly 

1445 1450 14S5 

gtc ate aac atg aac get get gca tgt caa tgg ggt agg teg ctg aaa 
Val He Asn Met Asn Ala Ala Ala Cys Gin Trp Gly Arg Ser Leu Lys 
1460 1465 1470 

ctt agg ata cct tea gaa gag ctt gcg aag ata cct aca age gga aca 
Leu Arg lie Pro Ser Glu Glu Leu Ala Lys lie Pro Thr Ser Gly Thr 
1475 1480 1485 

tgg aaa gca acg tta gtc ctg gat tat tta caa tgg ggc gga gac gat 
Trp Lys Ala Thr Leu Val Leu Asp Tyr Leu Gin Trp Gly Gly Asp Asp 
1490 1495 1S0O 



6414 



?462 



6510 



6558 



6606 



cct tta ggc aca tea ace aca gat ate acg ctg aat gta aca gac cac 6654 
Pro Leu Gly Thr Ser Thr Thr Asp He Thr Leu Asn Val Thr Asp His 
150 5 1510 1515 1520 

ttt: gct gaa aat gcg get att tac ttt ccg caa ttt ggt aca gca acg 6702 
Phe Ala Glu Asn Ala Ala He Tyr Phe Pro Gin Phe Gly Thr Ala Thr 

1525 1530 1535 

ccc egg gtg gac ctg aat ctt cac egg atg aat gec tea caa atg teg 
Pro Arg Val Asp Leu Asn Leu His Arg Met Asn Ala Ser Gin Met Ser 
1540 1545 1550 

ggc agg gct aat ctg gat atg tgt ctg tat gac gga ggt gtg aaa gec 
Gly Arg Ala Asn Leu Asp Met Cys Leu Tyr Asp Gly Gly Val Lys Ala 
1S55 1560 1565 

cgt tea tta cag atg aag ata gaa gga age aat aag tea ggt acg gga 
Arg Ser Leu Gin Met Lys He Glu Gly Ser Asn Lys Ser Gly Thr Gly 
1570 1575 1580 

ttt cag gtt ata aag age gat tct gct gat acg att gat tat gcg gtc 
Phe Gin Val He Lys Ser Asu Ser Ala Asp Thr He Asp Tyr Ala Val 
1585 1590 1595 1*00 

agt atg aat tat ggg gga cga agt att cct gtc ace cgt ggc gtg gag 
Ser Met Asn Tyr Gly Gly Arg Ser He Pro Val Thr Arg Gly Val Glu 
1605 1610 ISl 5 

ttc agt ctg gat aac gtg gat aaa gca gca acg cgt ccg gtg gta ctt 6990 
Phe Ser Leu Asp Asn Val Asp Lys Ala Ala Thr Arg Pro Val Val Leu 
1620 1625 1630 



6750 



6798 



6846 



6894 



6942 



ccc ggg caa egg cag gcg gta cgt tgt gtg cca gtg ccc ctt ace ctg 
Pro Gly Gin Arg Gin Ala Val Arg Cys Val Pro val Pro Leu Thr Leu 
1635 1640 1645 



7038 
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aca aca caa ccc tct aac ate aga gag aag cgt tec ggc gag cat cag 7086 
Thr Thr Gin Pro Phe Asn lie Arg Glu Lys Arg Ser Gly Glu Tyr Gin 
1650 1655 1660 

gga acg ccg aca gcg aca atg ccg atg gga aca caa acc ccc tga 7131 
Gly Thr Leu Thr Val Thr Met Leu Met Gly Thr Gin Thr Pro 
1665 1670 1675 

cagcaaccac ttatccract gatatcttcc eeaeaeggee ttttaaatca gagctctctt 7191 

tatacacccg cttcattcaa caaagagaac ctacccactt atgaaaacca aegegegagg 7251 

ctctgccccc cc acg acc gCg cat eta gat gat aaa gac aaa gaa tea ttg 7302 
Met Thr Val Tyr Leu Asp Asp Lys Asp Lys Glu Leu Leu 
1680 1685 1690 

aaa gaa acc caa aaa gac tgc gca caa acc CCa egg caa cct gca cat 7350 
Lys Glu lie Gin Lys Asp Cys Ala Gin Thr Leu Trp Gin Leu Ala Tyr 
1695 1700 1705 

aaa gcg gga etc acg ccc aca cca tgc ccc aaa cgc tta aaa aaa ccc 7398 
Lys Val Gly Leu Thr Pro Thr Pro Cys Phe Lys Arg Leu Lys Lys Leu 
1710 1715 1720 

aaa gac agg ggg get ate att ggc cag etc get eta ttg gat aag gaa 7446 
Lys Asp Arg Gly Val He He Gly Gin Phe Ala Leu Leu Asp Lys Glu 
1725 1730 1735 1740 

aaa eta ggt ctt tea ccc aac gtc ttt att atg att aac ata tct gag 7494 
Lys Leu Gly Leu Ser Leu Asn Val Phe He Met lie Asn He Ser Glu 

1745 1750 1755 

gag caa tac get age att tct gag aaa ata aag tea atg cct gag gtt 7542 
Glu Gin Tyr Ala Ser He Ser Glu Lys He Lys Ser Met Pro Glu Val 
1760 1765 1770 

ate gee tct cat cga ate cce gga tea ttt aat tat tea atg cat aca 7590 
He Ala Phe Tyr Arg lie Ser Gly Ser Phe Asn Tyr Leu Met His Thr 
1775 1780 1785 

gta ttt aca gac aCg aac gac cac cat age ttt tat gag aaa aca ata 7638 
Val Phe Thr Asp Met Asn Asp Tyr Tyr Ser Phe Tyr Glu Lys He He 
1790 1795 1800 

tta act aat tct tea ace age gga tct gca teg age cct gee ctt gag 7686 
Leu Thr Asn Ser Ser lie Ser Gly Ser Ala Ser Ser Phe Val Leu Glu 
1805 1810 1815 1820 

caa ata aag gaa aca aac gaa ctg tea gtg tga aagtgtgatg tgcacccacc 7739 
Gin lie Lys Glu Thr Asn Glu Leu Ser Val 

182S 1830 

gacecaacac accaccatcc eccccacgga acaacaaegg cagattgecg ccgttgaaca 7799 

aggatcccaa ccagcagcgg egaaattaag eggcacagaa taacacagcg gaatatcaca 7859 

tggttaaata tcaccccgcg catgtaacaa aaaacegcat taaaacagat gaegttaceg 7919 

aeatetattt cgttgaaccc ttctggaaaa aaggcgaaaa ccacacaacc gagccaccga 7979 
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tgttttttga agagttacaa aagtcactca atctatccaa ccataaatat gggttaaata 8039 

aacatacact caggatcccc tgggaatttg tgctcataca catggaaagg atcagcaaac 8099 

taaacagcgt cgggttattt gctgtttctg ccgactttaa taacaaccac aaacctctga 8159 

gcgagtacat caggagtcgc agagattatg gtatggaagt ttggtttgat ctctgcggta 8219 

aacactccna ttccagtgaa attaaaaacc ttggattctt ttttcaggct tgcgtagtgc 8279 

ctcgtgatcc taattttatt agtagtgttt atcattatca caagctccaa aagattcttg 8339 

tcggggatat aaacgacgta gaacagaggg ccgtgcacca gaacgaagtt gattacacgc 83 99 

atggaatgca atggccatcg tcatatgacg gttttttctt tcgggatcat aaaaaaaacg 8459 

aaacttggtg tatataacag aaggagtgaa aatttgaatc aaaaatatct tatctaccct 8519 

ttgttcaatt attgttttgt tttttattac gatcaaatac aaagaacacc attgttcgtg 8579 

cggtggggag gccggaagtt taggggatga ccgcccacca acaaccttat cacagccacc 863 9 

acacgaatgg tcnatacacg cactagatgt acnaccc tag ttnaacacac cgacggtcgc 8699 

tacccgcact gatgatgtcc cgctacatta aggaatacac atctgcacct cgctacacgc 8759 

acacccacat tactaatcac tattaatatg agtgtggttc ttgttttacg cacgcatggt 8819 

tgcatgtgac gttaaattca aatgagctga ccgtatgaat tctaaacact ttagagaggt 8879 

gttttttgtc tcggtagttg ttatatcatt attttatttg gtgttatttg cagccagcgc 8939 

tcatgctgaa ggcggtttca gatctggagg cattgggtta tttatgacgg gaacaagaga 8999 

gatgctactg tagagataat aaattctgct aaagattccc caacccttgt gcattgacac 9059 

cctccacgtc ctgaagggcg tgggttcctg ctccaacggg ctgcctgact gcacgctcct 9119 

tccacaggca agcacggcgt gccccgctct aaaacgtcac gcgcgccgtt tacaccggcg 9179 

ttcgcagtat atcttcatac cagacacttig taagtatctc gcataatcgt gccattcaca 923 9 

tttagagatc atac 9253 

<210> 2 
<211> 236 
<212> PRT 

<213 > Salmonella typhi 
<400> 2 

Met Asn Phe Lys Asp Thr Leu Pro Gly Val Phe Leu Cys Val Ala Met 
15 10 15 

Phe Ala Cys Gly His Ala Arg Ala Asn Met Leu Val Tyr Pro Met Ala 
20 25 30 

Ala Glu lie Asn Ser Ser Arg Glu Glu Ala Thr Ser Leu Phe Val Tyr 
35 40 45 
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Ser Lys Ser Asp His Val Gin Tyr lie Arg Thr Arg lie Met Arg lie 
SO 55 60 

Glu His Pro Gly Met Pro Gin Glu Lys Glu Val Pro Ala Gly Asn Asp 
65 70 75 80 

He Glu Thr Gly Leu Val Val Ser Pro Glu Lys Phe Ala Leu Ser Pro 

85 90 95 

Gly Thr Lys Lys Thr He Arg Val He Ser Thr Gin Ala Pro Glu Arg 
100 105 110 

Glu Glu Ala Trp Arg Val Tyr Phe Glu Ala Val Pro Glu Leu Glu Asp 
115 120 125 

Asp Pro Gin Ala Gly Gly Lys Gin Asn Ser Ser Val Ser Val Asn Leu 
130 135 140 

Val Trp Gly Val Leu Leu Arg Val Ser Pro Ser Asp Pro Arg Pro Ala 
145 150 155 160 

Leu Val Thr Asp Gly His His Leu Leu Asn Thr Gly Asn Thr Arg Leu 

165 , 170 175 

Ser Leu He Arg Ala Gly Asn Cys Asp Thr Thr Cys His Trp Gin Asn 
180 185 190 

He Gly Lys Ser He Tyr Pro Gly Gly Ser Ala Asp He Pro Ala Gly 
195 200 205 

He Lys Ser Asn Ala Phe Arg Val Glu Tyr Arg Thr Gly Ala Asn Ser 
210 215 220 

Pro Val He Ser Ala Asp Leu Thr Ala Ala Gly Lys 
225 230 235 



<210> 3 
<211> 191 
<212> PRT 

<213> Salmonella typhi 



<400> 3 

Met Tyr Thr Glu Cys Thr Tyr He Thr Val He Asn Asn Lys Ala Arg 
1 5 10 15 

Leu Phe Phe Met Asn Met Lys Thr Ser Phe He Ala Ala Ala Val Ala 
20 25 30 

Leu Ala Thr Val Tyr Ser Phe Ser Val Ser Ala Val Gin Lys Asp He 
35 40 45 

Thr Val Thr Ala Asn He Asp Ser Thr Leu Glu Leu Leu Gin Ala Asp 
50 55 60 



Gly Ser Ser Leu Pro Ser Thr Met Lys Leu Asp Phe Met Pro Gly Lys 
65 70 75 80 

Gly Leu Val His Lys Ser Leu Gin Thr Arg Leu Tyr Ser Asn Asp Gin 

85 90 95 
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Thr Lys Ser Val Asn Val Lys Leu Leu Asn Ala Pro Gin Leu lie Asn 
100 105 110 

Val Leu Asp Pro Thr Lys Thr lie Asp Met Glu Val Thr Leu Gly Gly 
115 120 125 

Arg Ser Leu Thr Thr Thr Asn Ser Val Leu Glu Ala Lys Thr Leu Phe 
130 135 140 

Pro Asp Gly Lys Thr Gly Asp Ala Ser Ala Leu Leu Asn Leu Asp lie 
145 150 155 160 

Gly Gin Lys Ala Gly Ala Ala Leu Gin Asn Leu Pro Ala Gly Glu Tyr 

165 170 175 

Ser Gly Leu Val Ser Leu Val lie Ser Gin Ala Val Thr Ala Gly 
180 185 190 



<210> 4 
<211> 889 
<212> PRT 

<213> Salmonella typhi 



<400> 4 

Met Tyr Tyr Leu Leu Gly Leu Cys Ser Phe Thr Ser Gin Ala Thr Leu 
15 10 15 

He Pro Pro Pro Gly Phe Glu Ser Leu Leu Glu Gly Gin Thr Glu Gin 
20 25 30 

lie Glu Val Leu Leu Pro Gly His Ser Leu Gly Leu Phe Pro Val Val 
35 40 45 



Val Lys Pro Asp Thr Val Gin Phe Met Ser Pro Leu Met Val Leu Glu 
SO 55 60 

Ser Ser Gly Leu Ala Ala Leu Pro Ala Ala Glu Arg Gin Lys Ala Leu 
65 70 75 80 

Ala Ala Leu Ser Arg Pro Leu Leu Arg Asn Ser Asn Leu Val Cys Gly 

85 90 95 

Val Ser Glu Ala Lys Asp Ser Ser Glu Cys Gly Tyr Val Ala Thr Asp 
100 105 110 

Lys Glu Asp Val Ala Val He Phe Asp Glu Asn Asn Ala Gin Leu Ser 
115 120 125 

Leu Phe Leu Asn Arg Asp Trp Leu Pro Asp Glu Glu Arg Arg Asp Lys 
130 135 140 

Arg Trp Leu Thr Pro Thr Pro Glu Gly Val Ser Ala Phe He His Arg 
145 150 155 160 

Gin Thr Leu Tyr Leu Ser Asp Asp Leu His Ser Arg Asn Met Thr Leu 

165 170 175 



Substitute sheet (Rule 26) 



WO 00/73336 



PCT/SE00/01079 



Asn Gly Ser Gly 
180 

Asp Trp Ala Ala 
195 

Ala Trp Phe Asp 
210 

Tyr Leu Gin Ala 
225 

Gly Gly Asp Phe 



Leu Arg Thr Gly 
260 

Asn Ala Thr Pro 
275 

lie Tyr Arg Gly 
290 

Met His Thr Leu 
305 

Ala Leu Arg Val 



Pro Phe Ser Lys 
340 

lie Gin Gly Gly 
355 

Gly Glu Thr Val 
370 

lie Ser Leu Thr 
385 

Glu Thr Arg Leu 



Leu Ser Ala Gly 
420 

Glu Gin Val Thr 
435 

His Thr Glu Ser 
450 

Ala Ser Met Thr 
465 

Val Gly Asn Trp 



Ala Leu Gly Leu 



lie Trp Asn Gin 
200 

Asn Leu Phe Val 
215 

Gly Arg Met Asp 
230 

Gly Phe Ser Leu 
245 

Thr Thr Gin Ala 



Val Met Val Gin 
280 

Ser Glu Leu Leu 
295 

Asp Thr His Ser 
310 

Tyr Glu Asp Gly 
325 

Gly Gly Asn Ser 



Leu Glu Asp Thr 
360 

Met Ala Ala Gly 
375 

Glu Gly lie Ser 
390 

Asn Ser Gin His 
405 

He Leu His Gly 



Tyr Asn Asp Gly 
440 

Asp Ala Cys Ser 
455 

Cys Gin Thr Ser 
470 

Tyr Ala Leu Leu 
485 



49 

Gly Asp Gly Arg 
185 

Ser Glu His Tyr 



Arg Gin Asp Leu 
220 

Gin Arg Asn Leu 
235 

Leu Pro Leu Ser 
250 

Tyr Val Asn His 
265 

Val Thr Arg Asn 



Gly Ser Gin Phe 
300 

Leu Pro Pro Gly 
315 

He Leu Arg Arg 
330 

Phe Ser Ala Gin 
345 

Gly Asp Lys Ala 



Phe Gin Thr Gly 
380 

Leu Ala His Glu 
395 

Ala Val Leu Asp 
410 

Thr Asp Ser Thr 
425 

Phe Ser Ala Ser 



Gly Arg His Pro 
460 

Met Asn Ala Ser 
475 

Gly Tyr Ser Thr 
490 



Tyr Leu Gly Gly 
190 

Asn Asn Ser Gin 
205 

Gly Asn Gin Tyr 



Ser Ser Ala Thr 
240 

Arg Phe Asp Gly 
255 

Glu Val Asp His 
270 

Ala Arg He Asp 
285 

Leu Thr Pro Gly 



Ser Tyr Pro Leu 
320 

Thr Glu Thr Gin 
335 

Thr Gin Trp Phe 
350 

Ser His Tyr Asp 
365 

Leu Arg Lys Asn 



Ala Trp Tyr Ser 
400 

Gly Thr Leu Asp 
415 

Ser Gly Asn Thr 
430 

Leu Trp Arg Asn 
445 

Gin Ser Val His 



Leu Ser Val Ser 
480 

Ser Arg Thr Glu 
495 
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Gly Arg Pro Val 
500 

Val Phe Trp Arg 
515 

Gin Ala Ser Ala 
530 

Thr His Gly Gly 
545 

Gly Leu Phe Met 



Met Thr Gly Ser 
580 

Ser Arg Asn Gin 
595 

Ser Trp Gin Gin 
610 

Tyr Asn Asp Asp 
625 

Arg Met Gly Glu 



Ala Gly Ser Ala 
660 

Leu Ser Arg Asn 
675 

Ala Ser Gly Met 
690 

Ser Gly Lys Val 
705 

Gly Phe Gly Gin 



Glu Val Thr He 
740 

Val Lys Ala Gly 
755 

Leu Leu Val His 
770 

Arg val Leu Asp 
785 

Asn Tyr Pro Phe 



Tyr Arg Gly Tyr 



Gin Ala Tyr He 
520 

Thr Tyr Ser Leu 
535 

Val Trp Arg Thr 
550 

Ser Val Ser Val 
565 

Asn Arg Tyr Thr 



Lys Thr Gin Thr 
600 

Asp Leu Tyr Arg 
615 

Ser Trp Ser Gly 
630 

Leu Ser Ala Thr 
645 

Ser Ser Leu Thr 



Gly Leu Phe Trp 
680 

Ala Val Asn Val 
695 

Val Ser Val Arg 
710 

Gin Ser Leu Leu 
725 

Glu Asp Ala Gly 



Gly Gly Ser Arg 
760 

Asn lie Ser Ala 
775 

Lys Asp Gly Arg 
790 

Leu Ser Leu Gly 
805 



50 

Asp Asp Asn Ser 
505 

Pro Ala Ser His 



Asn Met Ala Gly 
S40 

Arg Asn Asp Gly 
555 

Ser Tyr Ala Ser 
570 

Ser Ala Gly Thr 
585 

Ser Trp Asn Val 



Glu Leu Ser Val 
620 

Ser Leu Gly Gly 
635 

lie Ser Asn Ser 
650 

Ala Gly Tyr Ser 
665 

Gly Gly Gly Gin 



Glu Ser Glu Gly 
700 

Gly Ser Ser Gin 
715 

Leu Met Glu Gly 
730 

Val Ser Ser Gin 
745 

Cys Tyr Phe Leu 



Ser Met Ser Arg 
780 

Pro Leu Leu Asp 
795 

Pro Ser Gly Arg 
810 



Asp Lys Glu Asn 
510 

Arg Glu Ser Ala 
525 

Met Asn He Asn 



Val Asn Asp Asp 
560 

Gin Pro Pro Thr 
575 

Asp lie His Ser 
590 

Asn His Val Arg 
605 

Gly Phe Ser Gly 



Arg Met Ser Gly 
640 

His Gin Arg Asn 
655 

Ser Ser Leu Ala 
670 

Asp Gly Glu Pro 
685 

Asp Glu Gly Ser 



Pro Phe Ser Leu 
720 

Tyr Asn Ala Thr 
735 

Gly Met Ala Gly 
750 

Thr Pro Gly His 
765 

Leu Tyr Val Gly 



Ala Gin Pro Leu 
800 

Phe Ser Leu Gin 
815 
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Ser Glu His Lys Glu Ser Ser Leu Trp Leu Leu Ser Lys Asn Arg He 
820 825 830 

Leu Arg Cys Pro Met Ser Val His Lys Arg Arg Asp Val Met Gin Val 
835 840 845 

Val Gly Asp Val Arg Cys Glu Leu Ser Asp Val Asp Ala Leu Pro Gin 
850 855 860 

Ala Leu Gin He Ser Pro Arg Val lie Arg Leu Leu Asn Val Ala Gly 
865 870 875 B80 

Leu Leu Arg His Ser Val Gin Glu Ala 

885 



<210> 5 
<211> 359 
<212> PRT 

<213> Salmonella typhi 



<400> 5 

Met Ser Asn Lys Met Lys Trp Thr Ser Met Thr Ala His Trp Ser Ala 
1 5 10 15 

He He Asn Phe He Arg Lys Tyr Val Tyr Pro Ala Arg He He Ala 
20 25 30 

He Leu Leu Met Ala Gly Ala Thr Leu Pro Gin Val Ala Asp Ala He 
35 40 45 

Thr Val Asp Leu Asn Tyr Asp Lys Asn Asn Val Ala Val He Thr Pro 
50 " 55 60 

Val Trp Ser Gin Glu Trp Ser Val Ala Asn Val Leu Gly Gly Trp Val 
65 70 75 80 

Cys Arg Ser Asn Arg Asn Glu Asn Glu Gly Ala Cys Glu Glu Thr His 

85 90 95 



Leu Val Trp Trp 
100 

Phe Arg Glu Gin 
115 

Ser Val Arg Asp 
130 

Ala Cys Gin Trp 
14 5 

Leu Ala Lys He 



Tyr Ala Phe Gly 



lie Ser His Ala 
120 

Ala Cys Tyr Thr 
13 5 

Gly Arg Ser Leu 
150 

Pro Thr Ser Gly 
165 



Ala Tyr Ser Lys 
105 

Glu He Thr Leu 



Gly Val He Asn 
140 

Lys Leu Arg He 
155 

Thr Trp Lys Ala 
170 



He Arg Leu Arg 
110 

He Leu Leu Gly 
125 

Met Asn Ala Ala 



Pro Ser Glu Glu 
160 

Thr Leu Val Leu 
175 



Asp Tyr Leu Gin Trp Gly Gly Asp Asp Pro Leu Gly Thr Ser Thr Thr 
180 185 190 

Asp He Thr Leu Asn Val Thr Asp His Phe Ala Glu Asn Ala Ala He 
195 200 205 
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Tyr Phe Pro Gin 
210 

His Arg Met Asn 
225 

Cys Leu Tyr Asp 



Glu Giy Ser Asn 
260 

Ser Ala Asp Thr 
275 

Ser lie Pro Val 
290 

Lys Ala Ala Thr 
305 

Arg Cys Val Pro 



Arg Glu Lys Arg 
340 

Leu Met Gly Thr 
355 



Phe Gly Thr Ala 
215 

Ala Ser Gin Met 
230 

Gly Gly Val Lys 
245 

Lys Ser Gly Thr 



lie Asp Tyr Ala 
280 

Thr Arg Gly Val 
295 

Arg Pro Val Val 
310 

Val Pro Leu Thr 
325 

Ser Gly Glu Tyr 



Gin Thr Pro 



Thr Pro Arg Val 
220 

Ser Gly Arg Ala 
235 

Ala Arg Ser Leu 
250 

Gly Phe Gin Val 
265 

Val Ser Met Asn 



Glu Phe Ser Leu 
300 

Leu Pro Gly Gin 
315 

Leu Thr Thr Gin 
330 

Gin Gly Thr Leu 
345 



Asp Leu Asn Leu 



Asn Leu Asp Met 
240 

Gin Met Lys lie 
255 

lie Lys Ser Asp 
270 

Tyr Gly Gly Arg 
285 

Asp Asn Val Asp 



Arg Gin Ala Val 
320 

Pro Phe Asn I le 
335 

Thr Val Thr Met 
350 



<210> 6 
<211> 151 
<212> PRT 

<213> Salmonella typhi 
<400> 6 

Met Thr Val Tyr Leu Asp Asp Lys Asp Lys Glu Leu Leu Lys Glu lie 
15 10 15 

Gin Lys Asp Cys Ala Gin Thr Leu Trp Gin Leu Ala Tyr Lys Val Gly 
20 25 30 

Leu Thr Pro Thr Pro Cys Phe Lys Arg Leu Lys Lys Leu Lys Asp Arg 
35 40 45 



Gly Val lie lie Gly Gin Phe Ala 
50 55 

Leu Ser Leu Asn Val Phe lie Met 
65 70 

Ala Ser lie Ser Glu Lys lie Lys 

85 

Tyr Arg lie Ser Gly Ser Phe Asn 
100 



Leu Leu Asp Lys Glu Lys Leu Gly 
60 

lie Asn lie Ser Glu Glu Gin Tyr 
75 80 

Ser Met Pro Glu Val lie Ala Phe 
90 95 

Tyr Leu Met His Thr Val Phe Thr 
105 110 
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Asp Met Asn Asp Tyr Tyr Ser Phe Tyr Glu Lys He He Leu Thr Asn 
115 120 125 

Ser Ser He Ser Gly Ser Ala Ser Ser Phe Val Leu Glu Gin He Lys 
130 135 140 

Glu Thr Asn Glu Leu Ser Val 
145 ISO 
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